Spaces:
Running
Running
Request: Dynamic 2.0 Quants for EuroMoE-2.6B-A0.6B-Instruct-Preview
#11 opened 12 days ago
by
fernandoruiz
vLLM + openai/gpt-oss-20b on 3× RTX 3090 (CUDA 12.8) — FlashAttention Error
#10 opened 20 days ago
by
robinhassan

I can't run any of the dynamic bnb-4bit quants with TextGenerationInference
2
#6 opened 7 months ago
by
v3ss0n
Best AI Models for Spanish
1
#3 opened 10 months ago
by
orionsoftware333
Any Change you could do like a phi-3.5 vision?
❤️
1
3
#2 opened 12 months ago
by
Nick103
RTX 4070 Laptop
1
#1 opened about 1 year ago
by
muhammad-albasha