EmbeddingGemma 300M ggml-org/embeddinggemma-300M-GGUF 0.3B • Updated 11 days ago • 1.6k • 8 ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 558 • 2 ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 447 • 2
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 558 • 2
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 447 • 2
Gemma 3-270m Collection of models for Gemma 3-270m ggml-org/gemma-3-270m-GGUF 0.3B • Updated Aug 14 • 6.83k • 14 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15 • 2.94k • 15 ggml-org/gemma-3-270m-qat-GGUF 0.3B • Updated Aug 14 • 763 • 6 ggml-org/gemma-3-270m-it-qat-GGUF 0.3B • Updated Aug 15 • 968 • 10
Qwen 2 VL and Qwen 2.5 VL ggml-org/Qwen2.5-VL-3B-Instruct-GGUF 3B • Updated Apr 30 • 4.35k • 5 ggml-org/Qwen2.5-VL-7B-Instruct-GGUF 8B • Updated Apr 30 • 7.9k • 8 ggml-org/Qwen2.5-VL-32B-Instruct-GGUF 33B • Updated May 15 • 574 • 3 ggml-org/Qwen2-VL-2B-Instruct-GGUF 2B • Updated Apr 30 • 918 • 2
SmolVLM GGUF ggml-org/SmolVLM2-2.2B-Instruct-GGUF 2B • Updated Apr 30 • 7.24k • 20 ggml-org/SmolVLM2-500M-Video-Instruct-GGUF 0.4B • Updated Apr 30 • 2.16k • 12 ggml-org/SmolVLM2-256M-Video-Instruct-GGUF 0.2B • Updated Apr 30 • 673 • 6 ggml-org/SmolVLM-Instruct-GGUF 2B • Updated Apr 30 • 327 • 6
llama.cpp presets Models that are used for presets in llama.cpp. ggml-org/gte-small-Q8_0-GGUF Sentence Similarity • 0.0B • Updated Feb 6 • 115 • 2 ggml-org/bge-small-en-v1.5-Q8_0-GGUF Feature Extraction • 0.0B • Updated Feb 6 • 223 • 3 ggml-org/e5-small-v2-Q8_0-GGUF Sentence Similarity • 0.0B • Updated Feb 6 • 38
llama.vim Recommended models for the llama.vim and llama.vscode plugins ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF Text Generation • 0.5B • Updated Jan 31 • 720 • 6 ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 6.44k • 9 ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF Text Generation • 3B • Updated Nov 26, 2024 • 1.89k • 5 ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF Text Generation • 8B • Updated Oct 28, 2024 • 2.28k • 5
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 6.44k • 9
Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli Gemma 3 Collection 10 items • Updated 19 days ago • 19 Kimi-VL Collection 2 items • Updated 26 days ago ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 419 • 4 InternVL 3 and InternVL 2.5 Collection 10 items • Updated 26 days ago
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 419 • 4
GPT OSS ggml-org/gpt-oss-120b-GGUF 117B • Updated 25 days ago • 19.5k • 22 ggml-org/gpt-oss-20b-GGUF 21B • Updated 25 days ago • 130k • 75
Gemma 3n ggml-org/gemma-3n-E2B-it-GGUF 4B • Updated 24 days ago • 2.14k • 16 ggml-org/gemma-3n-E4B-it-GGUF 7B • Updated Jun 26 • 1.8k • 17
InternVL 3 and InternVL 2.5 ggml-org/InternVL3-1B-Instruct-GGUF 0.6B • Updated May 10 • 548 • 4 ggml-org/InternVL3-2B-Instruct-GGUF 2B • Updated May 10 • 385 • 5 ggml-org/InternVL3-8B-Instruct-GGUF 8B • Updated May 10 • 462 • 5 ggml-org/InternVL3-14B-Instruct-GGUF 15B • Updated May 10 • 525 • 4
Qwen 3 ggml-org/Qwen3-0.6B-GGUF 0.8B • Updated Apr 28 • 1k • 5 ggml-org/Qwen3-1.7B-GGUF 2B • Updated Apr 28 • 1.34k • 4 ggml-org/Qwen3-4B-GGUF 4B • Updated Apr 28 • 452 • 1 ggml-org/Qwen3-8B-GGUF 8B • Updated Apr 28 • 823 • 3
Gemma 3 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15 • 2.94k • 15 ggml-org/gemma-3-1b-it-GGUF 1.0B • Updated Mar 12 • 11k • 19 ggml-org/gemma-3-4b-it-GGUF Image-Text-to-Text • 4B • Updated May 21 • 25.5k • 40 ggml-org/gemma-3-12b-it-GGUF Image-Text-to-Text • 12B • Updated May 21 • 5.27k • 27
GGUF LoRA adapters Adapters extracted from fine tuned models, using mergekit-extract-lora ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF 0.1B • Updated Nov 1, 2024 • 36 ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF 0.1B • Updated Jan 23 • 26 • 2 ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF 0.1B • Updated Jan 9 • 24 • 1 ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF 0.1B • Updated Jan 8 • 29 • 3
Gemma 1.1 GGUFs ggml-org/gemma-1.1-2b-it-Q8_0-GGUF 3B • Updated Apr 5, 2024 • 1.52k • 1 ggml-org/gemma-1.1-7b-it-Q8_0-GGUF 9B • Updated Apr 5, 2024 • 26 ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF 9B • Updated Apr 5, 2024 • 1.22k • 4
EmbeddingGemma 300M ggml-org/embeddinggemma-300M-GGUF 0.3B • Updated 11 days ago • 1.6k • 8 ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 558 • 2 ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 447 • 2
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 558 • 2
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated about 13 hours ago • 447 • 2
Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli Gemma 3 Collection 10 items • Updated 19 days ago • 19 Kimi-VL Collection 2 items • Updated 26 days ago ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 419 • 4 InternVL 3 and InternVL 2.5 Collection 10 items • Updated 26 days ago
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • 24B • Updated May 1 • 419 • 4
Gemma 3-270m Collection of models for Gemma 3-270m ggml-org/gemma-3-270m-GGUF 0.3B • Updated Aug 14 • 6.83k • 14 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15 • 2.94k • 15 ggml-org/gemma-3-270m-qat-GGUF 0.3B • Updated Aug 14 • 763 • 6 ggml-org/gemma-3-270m-it-qat-GGUF 0.3B • Updated Aug 15 • 968 • 10
GPT OSS ggml-org/gpt-oss-120b-GGUF 117B • Updated 25 days ago • 19.5k • 22 ggml-org/gpt-oss-20b-GGUF 21B • Updated 25 days ago • 130k • 75
Gemma 3n ggml-org/gemma-3n-E2B-it-GGUF 4B • Updated 24 days ago • 2.14k • 16 ggml-org/gemma-3n-E4B-it-GGUF 7B • Updated Jun 26 • 1.8k • 17
InternVL 3 and InternVL 2.5 ggml-org/InternVL3-1B-Instruct-GGUF 0.6B • Updated May 10 • 548 • 4 ggml-org/InternVL3-2B-Instruct-GGUF 2B • Updated May 10 • 385 • 5 ggml-org/InternVL3-8B-Instruct-GGUF 8B • Updated May 10 • 462 • 5 ggml-org/InternVL3-14B-Instruct-GGUF 15B • Updated May 10 • 525 • 4
Qwen 2 VL and Qwen 2.5 VL ggml-org/Qwen2.5-VL-3B-Instruct-GGUF 3B • Updated Apr 30 • 4.35k • 5 ggml-org/Qwen2.5-VL-7B-Instruct-GGUF 8B • Updated Apr 30 • 7.9k • 8 ggml-org/Qwen2.5-VL-32B-Instruct-GGUF 33B • Updated May 15 • 574 • 3 ggml-org/Qwen2-VL-2B-Instruct-GGUF 2B • Updated Apr 30 • 918 • 2
Qwen 3 ggml-org/Qwen3-0.6B-GGUF 0.8B • Updated Apr 28 • 1k • 5 ggml-org/Qwen3-1.7B-GGUF 2B • Updated Apr 28 • 1.34k • 4 ggml-org/Qwen3-4B-GGUF 4B • Updated Apr 28 • 452 • 1 ggml-org/Qwen3-8B-GGUF 8B • Updated Apr 28 • 823 • 3
SmolVLM GGUF ggml-org/SmolVLM2-2.2B-Instruct-GGUF 2B • Updated Apr 30 • 7.24k • 20 ggml-org/SmolVLM2-500M-Video-Instruct-GGUF 0.4B • Updated Apr 30 • 2.16k • 12 ggml-org/SmolVLM2-256M-Video-Instruct-GGUF 0.2B • Updated Apr 30 • 673 • 6 ggml-org/SmolVLM-Instruct-GGUF 2B • Updated Apr 30 • 327 • 6
Gemma 3 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15 • 2.94k • 15 ggml-org/gemma-3-1b-it-GGUF 1.0B • Updated Mar 12 • 11k • 19 ggml-org/gemma-3-4b-it-GGUF Image-Text-to-Text • 4B • Updated May 21 • 25.5k • 40 ggml-org/gemma-3-12b-it-GGUF Image-Text-to-Text • 12B • Updated May 21 • 5.27k • 27
llama.cpp presets Models that are used for presets in llama.cpp. ggml-org/gte-small-Q8_0-GGUF Sentence Similarity • 0.0B • Updated Feb 6 • 115 • 2 ggml-org/bge-small-en-v1.5-Q8_0-GGUF Feature Extraction • 0.0B • Updated Feb 6 • 223 • 3 ggml-org/e5-small-v2-Q8_0-GGUF Sentence Similarity • 0.0B • Updated Feb 6 • 38
GGUF LoRA adapters Adapters extracted from fine tuned models, using mergekit-extract-lora ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF 0.1B • Updated Nov 1, 2024 • 36 ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF 0.1B • Updated Jan 23 • 26 • 2 ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF 0.1B • Updated Jan 9 • 24 • 1 ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF 0.1B • Updated Jan 8 • 29 • 3
llama.vim Recommended models for the llama.vim and llama.vscode plugins ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF Text Generation • 0.5B • Updated Jan 31 • 720 • 6 ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 6.44k • 9 ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF Text Generation • 3B • Updated Nov 26, 2024 • 1.89k • 5 ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF Text Generation • 8B • Updated Oct 28, 2024 • 2.28k • 5
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 6.44k • 9
Gemma 1.1 GGUFs ggml-org/gemma-1.1-2b-it-Q8_0-GGUF 3B • Updated Apr 5, 2024 • 1.52k • 1 ggml-org/gemma-1.1-7b-it-Q8_0-GGUF 9B • Updated Apr 5, 2024 • 26 ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF 9B • Updated Apr 5, 2024 • 1.22k • 4