Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

831

Full-text search

Active filters: quantization

Ne7/LTX2-Rapid-Merges-GGUF

Image-Text-to-Video • 19B • Updated 9 days ago • 277

groxaxo/Qwen3-4B-Instruct-2507-heretic-W4A16

Text Generation • 0.9B • Updated 8 days ago • 9

groxaxo/Qwen3-4B-Instruct-2507-heretic-W8A16

Text Generation • 1B • Updated 8 days ago • 6

namgyu-youn/EXAONE-4.0-1.2B-LLMC-AWQ-W4

0.6B • Updated 8 days ago • 69

oshkorinova/MamayLM-Gemma-3-12B-IT-v1.0-FP8-Dynamic

Text Generation • 12B • Updated about 15 hours ago • 16

EricRollei/HunyuanImage-3-INT8-v2

Text-to-Image • 83B • Updated 5 days ago • 34

EricRollei/HunyuanImage-3-NF4-v2

Text-to-Image • 83B • Updated 5 days ago • 47

EricRollei/HunyuanImage-3.0-Instruct-INT8-v2

Text-to-Image • 83B • Updated 4 days ago • 9

EricRollei/HunyuanImage-3.0-Instruct-NF4-v2

Text-to-Image • 83B • Updated 4 days ago • 18

AlaminI/nllb-200-600M-ct2-float16

Translation • Updated 4 days ago • 18

AlaminI/nllb-200-600M-ct2-int8

Translation • Updated 4 days ago • 8

Jakubrd4/Bielik-11B-v2.3-Instruct-QuIP-2bit

Text Generation • 0.6B • Updated 4 days ago • 18

AlaminI/nllb-200-600M-nf4-custom-weights-bare-metal

Translation • Updated 2 days ago • 15

MO7YW4NG/ms-marco-MiniLM-L-6-v2-4bit-nf4

Text Ranking • 23.1M • Updated 1 day ago • 27

groxaxo/qwen3-embed-8b-gptq

Feature Extraction • 2B • Updated 3 days ago • 15

ApacheOne/Qwen2.5-VL-7B-Instruct-abliterated-nvfp4

Updated 1 day ago • 115

JongYeop/Llama-3.1-8B-Instruct-MXFP4-W4A4

Text Generation • 5B • Updated 2 days ago • 70

JongYeop/Qwen2.5-7B-Instruct-MXFP4-W4A4

Text Generation • 5B • Updated 2 days ago • 19

jameshhugg/OptiMind-SFT-Q4_K_M-GGUF

21B • Updated 1 day ago • 36

oshkorinova/MamayLM-Gemma-3-12B-IT-v1.0-FP8-Static-Ultrachat-200k

Text Generation • 12B • Updated about 15 hours ago • 18

AlaminI/nllb-200-600M-int8-custom-weights-bare-metal

Translation • Updated about 10 hours ago • 15