JunHowie
JunHowie
AI & ML interests
None yet
Recent Activity
new activity
about 15 hours ago
QuantTrio/sarvam-105b-AWQ:Do you take quant requests? updated
a model 1 day ago
QuantTrio/sarvam-105b-AWQ published
a model 1 day ago
QuantTrio/sarvam-105b-AWQ Organizations
Do you take quant requests?
1
#1 opened about 22 hours ago
by
pathosethoslogos
--max-model-len 32768 seems a bit too small for agent use cases ?
3
#3 opened 3 days ago
by
edwarddukewu
AWQ
π€ 3
1
#3 opened about 1 month ago
by
darkstar3537
Great work
5
#1 opened 15 days ago
by
JoeyHwong
Qwen3.5-397B-A17B-AWQ vs Qwen3.5-122B-A10B
2
#2 opened 17 days ago
by
zuuky
Kimi-K2.5-E192 ?
1
#2 opened 27 days ago
by
Rebis
Qwen3.5 AWQ 4 Bit
2
#1 opened 22 days ago
by
yuchenxie
Qwen3.5 AWQ
1
#3 opened 25 days ago
by
timroethig
MiniMax-M2.5-AWQ please
π₯ 1
3
#3 opened 29 days ago
by
olka-fi
After deploying locally, I keep encountering errors when running the examples. Is there any solution
1
#1 opened about 2 months ago
by
AndyLeaf666
Once again Thanks, here is my review for 8 x RTX 5090 setup
17
#2 opened 3 months ago
by
crystech
The model startup using vllm failed.
10
#5 opened 2 months ago
by
beausoft
Thank you for your Quant!
π 3
7
#1 opened 3 months ago
by
mtcl
This model is awesome!
π π 2
3
#4 opened 3 months ago
by
crystech
[request] DeepSeek-V3.1-Terminus
4
#3 opened 3 months ago
by
willfalco
Aww Man!
20
#1 opened 3 months ago
by
mtcl
ooof this fits in 4x96gb can we get this for the new 3.2 Speciale ase well please :)
16
#2 opened 3 months ago
by
Fernanda24
Will it support Ampere GPU?
2
#1 opened 5 months ago
by
swearofwind
vllm==0.10.0 load error
1
#1 opened 3 months ago
by
janezhou
AWQ quant for the 162B REAP distill?
4
#5 opened 4 months ago
by
BW-Projects