-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
Updated
•
16.4k
•
1.05k
lmstudio-community/DeepSeek-R1-0528-Qwen3-8B-MLX-8bit
Text Generation
•
Updated
•
339k
•
4
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
Updated
•
2.43k
•
186
speakleash/Bielik-11B-v2.6-Instruct-MLX-8bit
Text Generation
•
Updated
•
39
•
2
Qwen/Qwen1.5-4B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
32
•
6
MaziyarPanahi/ChatMusician-GGUF
Text Generation
•
Updated
•
307
•
13
MaziyarPanahi/WizardLM-2-8x22B-GGUF
Text Generation
•
Updated
•
1.57k
•
128
MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-GGUF
Text Generation
•
Updated
•
1.29k
•
33
atcsecure/dolphin-2.9-llama3-70b-8.0bpw-h8-exl2
Text Generation
•
Updated
•
25
•
2
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
2.13k
•
25
brunopio/Llama3-8B-1.58-100B-tokens-GGUF
Text Generation
•
Updated
•
2.28k
•
16
MaziyarPanahi/ruadapt_qwen2.5_3B_ext_u48_instruct_v4-GGUF
Text Generation
•
Updated
•
48
•
1
PrunaAI/PJMixers-Dev-Qwen2.5-RomboTiesTest-7B-bnb-8bit-smashed
Updated
•
29
•
1
LucidityAI/pico-mini-v1-.5b
Text Generation
•
Updated
•
18
•
1
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-Text-to-Text
•
Updated
•
1.38k
•
5
Emilio407/nllb-200-distilled-600M-8bit
Translation
•
Updated
•
170
•
1
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text
•
Updated
•
9.47k
•
5
tiiuae/Falcon-E-3B-Instruct
Text Generation
•
Updated
•
1.33k
•
29
mlx-community/Qwen3-30B-A3B-8bit
Text Generation
•
Updated
•
1.4k
•
6
premkumarkora/qwen3-shakespeare-final
Text Generation
•
Updated
•
8
•
1
mlx-community/Devstral-Small-2505-8bit
Text Generation
•
Updated
•
988
•
1
bullerwins/DeepSeek-R1-0528-Qwen3-8B-exl3-8.0bpw
Text Generation
•
Updated
•
43
•
1
OrionCAF/qwen2_5_turkish_vlm
Updated
•
140
•
3
Disya/DeepSeek-R1-0528-Qwen3-8B-exl2-8bpw-h8
Text Generation
•
Updated
•
16
•
1
Cosmobillian/qwen2_5-turkish-vlm
Updated
•
79
•
1
mlx-community/Nemotron-Research-Reasoning-Qwen-1.5B-8bit
Updated
•
108
•
1
RedHatAI/gemma-3-27b-it-quantized.w8a8
Image-Text-to-Text
•
Updated
•
6
•
1
mlx-community/plamo-2-translate-8bit
Text Generation
•
Updated
•
19
•
1
echarlaix/distilbert-base-uncased-finetuned-sst-2-english-int8-dynamic
Text Classification
•
Updated
•
1.72k
•
1
Intel/distilbert-base-uncased-finetuned-sst-2-english-int8-dynamic-inc
Text Classification
•
Updated
•
15
•
1