Edit Models filters
Apps
Inference Providers
Active filters: quantization
groxaxo/Qwen3-4B-Instruct-2507-heretic-W4A16
Text Generation • 0.9B • Updated
• 9
groxaxo/Qwen3-4B-Instruct-2507-heretic-W8A16
Text Generation • 1B • Updated
• 6
namgyu-youn/EXAONE-4.0-1.2B-LLMC-AWQ-W4
0.6B • Updated
• 69
oshkorinova/MamayLM-Gemma-3-12B-IT-v1.0-FP8-Dynamic
Text Generation • 12B • Updated
• 16
EricRollei/HunyuanImage-3-INT8-v2
Text-to-Image • 83B • Updated
• 34
EricRollei/HunyuanImage-3-NF4-v2
Text-to-Image • 83B • Updated
• 47
EricRollei/HunyuanImage-3.0-Instruct-INT8-v2
Text-to-Image • 83B • Updated
• 9
EricRollei/HunyuanImage-3.0-Instruct-NF4-v2
Text-to-Image • 83B • Updated
• 18
AlaminI/nllb-200-600M-ct2-float16
Translation • Updated
• 18
AlaminI/nllb-200-600M-ct2-int8
Translation • Updated
• 8
Jakubrd4/Bielik-11B-v2.3-Instruct-QuIP-2bit
Text Generation • 0.6B • Updated
• 18
AlaminI/nllb-200-600M-nf4-custom-weights-bare-metal
Translation • Updated
• 15
MO7YW4NG/ms-marco-MiniLM-L-6-v2-4bit-nf4
groxaxo/qwen3-embed-8b-gptq
Feature Extraction • 2B • Updated
• 15
ApacheOne/Qwen2.5-VL-7B-Instruct-abliterated-nvfp4
Updated
• 115
JongYeop/Llama-3.1-8B-Instruct-MXFP4-W4A4
Text Generation • 5B • Updated
• 70
JongYeop/Qwen2.5-7B-Instruct-MXFP4-W4A4
Text Generation • 5B • Updated
• 19
jameshhugg/OptiMind-SFT-Q4_K_M-GGUF
21B • Updated
• 36
oshkorinova/MamayLM-Gemma-3-12B-IT-v1.0-FP8-Static-Ultrachat-200k
Text Generation • 12B • Updated
• 18
AlaminI/nllb-200-600M-int8-custom-weights-bare-metal
Translation • Updated
• 15