Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ISTA-DASLab 's Collections
FP-Quant QAT
MR-GPTQ
GGUF
Gemma3-GPTQ
QuEST
HIGGS
AQLM+PV
AQLM

FP-Quant QAT

updated 1 day ago

High-quality QAT FP4 models to use with the fp_quant vLLM/Transformers integration on Blackwell NVIDIA GPUs. See https://arxiv.org/abs/2509.23202

Upvote
-

  • ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4

    Text Generation • 0.8B • Updated 1 day ago • 17

  • ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-MXFP4

    Text Generation • 0.8B • Updated 1 day ago • 14

  • ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-NVFP4

    Text Generation • 2B • Updated 1 day ago • 21

  • ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-MXFP4

    Text Generation • 2B • Updated 1 day ago • 15

  • ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-NVFP4

    Text Generation • 5B • Updated 1 day ago • 31

  • ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-MXFP4

    Text Generation • 5B • Updated 1 day ago • 23

  • ISTA-DASLab/Qwen3-0.6B-FPQuant-QAT-NVFP4

    Text Generation • 0.4B • Updated 1 day ago • 21

  • ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4

    Text Generation • 1B • Updated 1 day ago • 12

  • ISTA-DASLab/Qwen3-4B-FPQuant-QAT-NVFP4

    Text Generation • 2B • Updated 1 day ago • 16

  • ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4

    Text Generation • 5B • Updated 1 day ago • 21

  • ISTA-DASLab/Qwen3-8B-FPQuant-QAT-MXFP4

    Text Generation • 5B • Updated 1 day ago • 43
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs