Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

23,264

Full-text search

Active filters: llama-cpp

martintomov/mathstral-7B-v0.1-Q4_K_M-GGUF

7B • Updated Jul 16, 2024 • 47 • 1

martintomov/mathstral-7B-v0.1-Q8_0-GGUF

7B • Updated Jul 16, 2024 • 1

martintomov/mathstral-7B-v0.1-Q2_K-GGUF

7B • Updated Jul 16, 2024 • 38

julioc-p/Gemma2BFullPrecisionDifferentChatTemplate-Q4_K_M-GGUF

3B • Updated Jul 16, 2024 • 2

martintomov/gpt2_1558M_final3_hf-Q4_K_M-GGUF

2B • Updated Jul 16, 2024 • 1

Srinath-Pulaverthi/Arithmo2-Mistral-7B-Q5_K_M-GGUF

7B • Updated Jul 16, 2024 • 7 • 1

Kurgan1138/L3-8B-Celeste-v1-Q6_K-GGUF

8B • Updated Jul 16, 2024 • 1

exocet25/mathstral-7B-v0.1-Q4_K_M-GGUF

7B • Updated Jul 16, 2024 • 1

Srinath-Pulaverthi/llemma_7b-Q4_K_M-GGUF

7B • Updated Jul 17, 2024 • 3 • 1

Ransss/L3-15B-EtherealMaid-t0.0001-Q6_K-GGUF

15B • Updated Jul 16, 2024 • 1

NikolayKozloff/mathstral-7B-v0.1-Q8_0-GGUF

7B • Updated Jul 16, 2024 • 1 • 1

Ransss/L3-SthenoMaidBlackroot-15B-Q6_K-GGUF

15B • Updated Jul 16, 2024 • 1

NikolayKozloff/Lite-Mistral-150M-v2-Instruct-Q8_0-GGUF

0.2B • Updated Jul 16, 2024 • 1 • 1

mbahrsnc/mini-mcqueen-1.1b-Q4_K_M-GGUF

1B • Updated Jul 17, 2024 • 5

Kurgan1138/L3-8b-Rosier-v1-Q6_K-GGUF

8B • Updated Jul 16, 2024 • 1

Marlon81/Phi-3-mini-4k-instruct-Q5_K_M-GGUF

Text Generation • 4B • Updated Jul 16, 2024 • 17

pegasus912/Gemma-Radiation-RP-9B-Q5_K_M-GGUF

9B • Updated Jul 16, 2024 • 1 • 1

dnsch/Gemma2BFullPrecision-Q4_K_M-GGUF

3B • Updated Jul 16, 2024 • 3

dnsch/Gemma2BFullPrecision-Q8_0-GGUF

3B • Updated Jul 16, 2024 • 8

reach-vb/mathstral-7B-v0.1-Q8_0-GGUF

7B • Updated Jul 16, 2024 • 7 • 1

dnsch/Gemma2BFullPrecision-Q6_K-GGUF

3B • Updated Jul 16, 2024 • 8

anosh-rezaei/Gemma2BFullPrecision-Q8_0-GGUF

3B • Updated Jul 16, 2024 • 4

EnsonWu/Llama3-TAIDE-LX-8B-Chat-Alpha1-Q4_K_M-GGUF

8B • Updated Jul 16, 2024 • 7

tburger87/AutoCoder_S_6.7B-Q8_0-GGUF

7B • Updated Jul 16, 2024

catbash/Mistral-7B-v0.1-flashback-v2-instruct-Q5_K_M-GGUF

Text Generation • 7B • Updated Jul 17, 2024 • 14 • 2

mbahrsnc/mini-mcqueen-1.1b-IQ4_NL-GGUF

1B • Updated Jul 17, 2024 • 2

joshnader/mathstral-7B-v0.1-Q4_K_M-GGUF

7B • Updated Jul 17, 2024 • 7

joshnader/mathstral-7B-v0.1-Q8_0-GGUF

7B • Updated Jul 17, 2024

SkyNotion/gpt2-Q4_K_M-GGUF

0.2B • Updated Jul 17, 2024 • 4

amitj1jan/Meta-Llama-3-8B-Q3_K_M-GGUF

Text Generation • 8B • Updated Jul 17, 2024