
ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4
Text Generation
•
0.8B
•
Updated
•
17
High-quality QAT FP4 models to use with the fp_quant vLLM/Transformers integration on Blackwell NVIDIA GPUs. See https://arxiv.org/abs/2509.23202