
neuralmagic/Llama-3.2-3B-Instruct-quantized.w8a8
Text Generation
•
4B
•
Updated
•
379
LLMs, optimization, compression, sparsification, quantization, pruning, distillation, NLP, CV