Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
z-lab 's Collections
DFlash
ParoQuant

ParoQuant

updated about 12 hours ago

Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Upvote
4

  • ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

    Paper • 2511.10645 • Published Nov 13, 2025 • 6

  • z-lab/Qwen3.5-4B-PARO

    1B • Updated about 23 hours ago • 103 • 1

  • z-lab/Qwen3.5-0.8B-PARO

    Image-Text-to-Text • 0.4B • Updated about 16 hours ago • 65

  • z-lab/Qwen3.5-2B-PARO

    1B • Updated about 23 hours ago • 4

  • z-lab/Qwen3.5-9B-PARO

    3B • Updated about 23 hours ago • 4

  • z-lab/Qwen3-8B-PARO

    Text Generation • 1B • Updated about 15 hours ago • 990

  • z-lab/Qwen3-4B-PARO

    Text Generation • 0.9B • Updated about 15 hours ago • 242

  • z-lab/Qwen3-0.6B-PARO

    0.2B • Updated about 16 hours ago • 61

  • z-lab/Qwen3-1.7B-PARO

    Text Generation • 0.5B • Updated about 15 hours ago

  • z-lab/Qwen3-14B-PARO

    Text Generation • 2B • Updated about 15 hours ago • 4

  • z-lab/Qwen3-4B-Thinking-2507-PARO

    1B • Updated Oct 31, 2025 • 7

  • z-lab/Llama-3.1-8B-Instruct-PARO

    Text Generation • 1B • Updated about 14 hours ago

  • z-lab/Meta-Llama-3-8B-PARO

    1B • Updated Oct 29, 2025 • 3

  • z-lab/Llama-2-7b-hf-PARO

    Text Generation • 1B • Updated about 14 hours ago

  • z-lab/DeepSeek-R1-Distill-Llama-8B-PARO

    1B • Updated Oct 29, 2025 • 1

  • z-lab/paroquant-checkpoints

    Updated about 17 hours ago
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs