Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Model Tree
Reset
nvidia/Llama-3.1-Minitron-4B-Width-Base
Adapters
Finetunes
Quantizations
Merges
Inference Providers
Select all
Nscale
Novita
SambaNova
Together AI
fal
Fireworks
Hyperbolic
Cerebras
Nebius AI Studio
Replicate
Cohere
HF Inference API
Misc
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
16
Full-text search
Edit filters
Sort: Trending
Active filters:
nvidia/Llama-3.1-Minitron-4B-Width-Base
Clear all
NikolayKozloff/Llama-3.1-Minitron-4B-Width-Base-Q8_0-GGUF
Updated
Aug 16, 2024
•
5
•
8
bartowski/Llama-3.1-Minitron-4B-Width-Base-GGUF
Text Generation
•
Updated
Aug 27, 2024
•
313
legraphista/Llama-3.1-Minitron-4B-Width-Base-GGUF
Text Generation
•
Updated
Aug 17, 2024
•
602
•
13
mradermacher/Llama-3.1-Minitron-4B-Width-Base-GGUF
Updated
Feb 14
•
93
QuantFactory/magnum-v2-4b-GGUF
Text Generation
•
Updated
Aug 25, 2024
•
114
•
1
Theta-Lev/Llama-3.1-Minitron-4B-Width-Base-Q8_0-GGUF
Updated
Aug 28, 2024
•
2
altomek/Llama-3.1-Minitron-4B-Width-Base-8bpw-EXL2
Updated
Aug 30, 2024
•
3
altomek/Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_4-GGUF
Updated
Aug 30, 2024
•
8
ijohn07/Llama-3.1-Minitron-4B-Width-Base-Q5_K_M-GGUF
Updated
Sep 4, 2024
•
5
Solshine/Llama-3.1-Minitron-4B-Width-Base-Q4_K_M-GGUF
Updated
Sep 12, 2024
•
3
•
1
finnwengWCH/Llama-3.1-Minitron-4B-Width-Base-Q4_K_M-GGUF
Updated
Sep 14, 2024
•
1
Gint0ki/Llama-3.1-Minitron-4B-Width-Base-Q8_0-GGUF
Updated
Sep 15, 2024
•
2
QuantFactory/MagpieLM-4B-SFT-v0.1-GGUF
Updated
Sep 20, 2024
•
3
•
2
psx7/llama4B
Text Generation
•
Updated
Oct 2, 2024
•
631
PrunaAI/nvidia-Llama-3.1-Minitron-4B-Width-Base-GGUF-smashed
Updated
Feb 27
•
4
Mungert/Llama-3.1-Nemotron-Nano-4B-v1.1-GGUF
Text Generation
•
Updated
about 12 hours ago
•
1.73k