JunHowie
JunHowie
AI & ML interests
None yet
Recent Activity
new activity about 19 hours ago
QuantTrio/GLM-5-AWQ:vllm部署失败 new activity 2 days ago
QuantTrio/sarvam-105b-AWQ:Do you take quant requests? new activity 2 days ago
QuantTrio/Qwen3.5-9B-AWQ:why cuda12.8 needed?Organizations
vllm部署失败
5
#3 opened 3 days ago
by
Yuxin362
Do you take quant requests?
1
#1 opened 6 days ago
by
pathosethoslogos
why cuda12.8 needed?
1
#1 opened 2 days ago
by
justplus
--max-model-len 32768 seems a bit too small for agent use cases ?
3
#3 opened 8 days ago
by
edwarddukewu
AWQ
🤝 3
1
#3 opened about 2 months ago
by
darkstar3537
Great work
5
#1 opened 20 days ago
by
JoeyHwong
Qwen3.5-397B-A17B-AWQ vs Qwen3.5-122B-A10B
2
#2 opened 22 days ago
by
zuuky
Kimi-K2.5-E192 ?
1
#2 opened about 1 month ago
by
Rebis
Qwen3.5 AWQ 4 Bit
2
#1 opened 27 days ago
by
yuchenxie
Qwen3.5 AWQ
1
#3 opened about 1 month ago
by
timroethig
MiniMax-M2.5-AWQ please
🔥 1
3
#3 opened about 1 month ago
by
olka-fi
After deploying locally, I keep encountering errors when running the examples. Is there any solution
1
#1 opened about 2 months ago
by
AndyLeaf666
Once again Thanks, here is my review for 8 x RTX 5090 setup
17
#2 opened 3 months ago
by
crystech
The model startup using vllm failed.
10
#5 opened 3 months ago
by
beausoft
Thank you for your Quant!
👀 3
7
#1 opened 3 months ago
by
mtcl
This model is awesome!
👀🚀 2
3
#4 opened 3 months ago
by
crystech
[request] DeepSeek-V3.1-Terminus
4
#3 opened 3 months ago
by
willfalco
Aww Man!
20
#1 opened 4 months ago
by
mtcl
ooof this fits in 4x96gb can we get this for the new 3.2 Speciale ase well please :)
16
#2 opened 4 months ago
by
Fernanda24
Will it support Ampere GPU?
2
#1 opened 6 months ago
by
swearofwind