INT4 LLMs for vLLM Collection Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! β’ 18 items β’ Updated Sep 26, 2024 β’ 11
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated Jun 18, 2025 β’ 1.46M β’ β’ 4.38k
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-FP16-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune Token Classification β’ 0.3B β’ Updated Jun 1, 2024