-
-
-
-
-
-
Inference Providers
Active filters:
gptq
AIDC-AI/Ovis2-16B-GPTQ-Int4
Image-Text-to-Text
•
5B
•
Updated
•
1.36k
•
5
numind/NuExtract-2.0-8B-GPTQ
Image-Text-to-Text
•
3B
•
Updated
•
393
•
3
TheBloke/WizardCoder-15B-1.0-GPTQ
Text Generation
•
3B
•
Updated
•
997
•
178
TheBloke/Wizard-Vicuna-13B-Uncensored-SuperHOT-8K-GPTQ
Text Generation
•
2B
•
Updated
•
980
•
149
TheBloke/Nous-Hermes-Llama2-GPTQ
Text Generation
•
2B
•
Updated
•
886
•
60
Qwen/Qwen-7B-Chat-Int4
Text Generation
•
2B
•
Updated
•
2.09k
•
74
Qwen/Qwen-VL-Chat-Int4
Text Generation
•
4B
•
Updated
•
3.05k
•
92
TheBloke/LLaMA2-13B-Tiefighter-GPTQ
Text Generation
•
2B
•
Updated
•
31
•
34
TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ
Text Generation
•
6B
•
Updated
•
66.8k
•
138
TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
Text Generation
•
1B
•
Updated
•
74.8k
•
53
TheBloke/Kunoichi-7B-GPTQ
Text Generation
•
1B
•
Updated
•
27
•
16
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
1.14k
•
46
Qwen/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
2.91k
•
28
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
2B
•
Updated
•
25.5k
•
27
shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8
4B
•
Updated
•
156
•
3
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int4
Text Generation
•
6B
•
Updated
•
156k
•
36
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int8
Text Generation
•
10B
•
Updated
•
81.9k
•
11
xmadai/Mistral-Large-Instruct-2407-xMADai-INT4
Text Generation
•
17B
•
Updated
•
17
•
7
AIDC-AI/Ovis2-8B-GPTQ-Int4
Image-Text-to-Text
•
3B
•
Updated
•
3.32k
•
3
kaitchup/Qwen3-0.6B-autoround-4bit-gptq
0.2B
•
Updated
•
30
•
2
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
27.9k
•
18
Qwen/Qwen3-235B-A22B-GPTQ-Int4
Text Generation
•
Updated
•
6.4k
•
20
Intel/DeepSeek-R1-0528-Qwen3-8B-int4-AutoRound-gptq-inc
2B
•
Updated
•
11.6k
•
3
numind/NuExtract-2.0-4B-GPTQ
Image-Text-to-Text
•
1B
•
Updated
•
181
•
2
ramgpt/jan-nano-4b-gptqmodel-4bit
Text Generation
•
0.9B
•
Updated
•
261
•
2
tencent/Hunyuan-A13B-Instruct-GPTQ-Int4
Text Generation
•
11B
•
Updated
•
103k
•
47
LGAI-EXAONE/EXAONE-4.0-1.2B-GPTQ-Int8
Text Generation
•
0.5B
•
Updated
•
120
•
7
elinas/alpaca-13b-lora-int4
Text Generation
•
Updated
•
8
•
41
elinas/alpaca-30b-lora-int4
Text Generation
•
Updated
•
11
•
68
mayaeary/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
43
•
40