-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
wikeeyang/Emu35-Image-NF4
35B
•
Updated
•
456
•
10
Jalea96/DeepSeek-OCR-bnb-4bit-NF4
Image-Text-to-Text
•
3B
•
Updated
•
3.7k
•
9
sanchezalonsodavid17/DeepSeek-OCR-MBQ-Quantized-v1
Image-Text-to-Text
•
3B
•
Updated
•
261
•
4
wikeeyang/Emu35-NF4
35B
•
Updated
•
46
•
4
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
101k
•
28
mlx-community/DeepSeek-OCR-4bit
Image-Text-to-Text
•
0.8B
•
Updated
•
1.99k
•
5
mlx-community/Kimi-Linear-48B-A3B-Instruct-4bit
Text Generation
•
49B
•
Updated
•
988
•
3
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
Text Generation
•
2B
•
Updated
•
48.6k
•
38
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
609k
•
32
numen-tech/Llama-3.3-70B-Instruct-abliterated-w4a16g128sym
Text Generation
•
Updated
•
2
unsloth/Qwen3-8B-bnb-4bit
5B
•
Updated
•
502k
•
5
Qwen/Qwen2.5-Omni-7B-AWQ
Any-to-Any
•
5B
•
Updated
•
18k
•
13
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-AWQ
Text Generation
•
66B
•
Updated
•
1.45k
•
7
inferencerlabs/Kimi-K2-Instruct-MLX-3.985bit
Text Generation
•
1T
•
Updated
•
281
•
7
dousery/medical-reasoning-gpt-oss-20b
Text Generation
•
21B
•
Updated
•
4.98k
•
44
mlx-community/gpt-oss-20b-MXFP4-Q4
Text Generation
•
21B
•
Updated
•
945
•
7
mlx-community/gpt-oss-20b-MXFP4-Q8
Text Generation
•
21B
•
Updated
•
832k
•
16
unsloth/Qwen3-VL-8B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
9B
•
Updated
•
32.6k
•
9
Disty0/Wan2.2-I2V-A14B-SDNQ-uint4-svd-r32
Updated
•
101
•
2
pherber3/Qwen3-Omni-30B-A3B-Instruct-4bit-mlx
31B
•
Updated
•
217
•
3
TheBloke/vicuna-7B-v1.5-GPTQ
Text Generation
•
1B
•
Updated
•
86
•
16
TheBloke/Mistral-7B-v0.1-AWQ
Text Generation
•
1B
•
Updated
•
880
•
33
TheBloke/LLaMA-Pro-8B-Instruct-AWQ
Text Generation
•
1B
•
Updated
•
218
•
2
herisan/tinyllama-bnb-4bit_mental_health_counseling_conversations
Text Generation
•
0.6B
•
Updated
•
1
Qwen/Qwen1.5-72B-Chat-AWQ
Text Generation
•
12B
•
Updated
•
1.31k
•
25
nateraw/defog-sqlcoder-70b-alpha-awq
Text Generation
•
10B
•
Updated
•
5
•
2
Intel/phi-2-int4-inc
Text Generation
•
0.6B
•
Updated
•
15
•
4
MaziyarPanahi/WizardLM-2-7B-GGUF
Text Generation
•
7B
•
Updated
•
87.4k
•
83
MaziyarPanahi/Meta-Llama-3-70B-Instruct-GGUF
Text Generation
•
71B
•
Updated
•
2.73k
•
170
chansung/mental_health_counseling_v0.1