What key issues have been encountered in GGUF conversion?
#12 opened 15 days ago
by
NKLAR5

batched inference with video input
#11 opened about 1 month ago
by
vexilligera
Looking Forward to models in GPTQModel formats like W4A16 and W8A16
#10 opened about 2 months ago
by
X-SZM
afs
#8 opened 2 months ago
by
Marc-Anthony

Bitsandbytesconfig 4bit possible?
#6 opened 2 months ago
by
Day1Kim
zai-org/GLM-4.5V not working in sglang please help. I have 8xh100
#5 opened 2 months ago
by
dahwinsingularity

LoRA adapter?
#3 opened 2 months ago
by
lightenup
A look into the future: Wishlist for GLM-5
👍
11
4
#2 opened 2 months ago
by
Dampfinchen
Text performance compared to GLM-4.5 Air
👀
5
2
#1 opened 2 months ago
by
Dampfinchen