Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Inference Optimization

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ChibuUkachi  updated a model 2 days ago
inference-optimization/Ministral-3-14B-Instruct-2512-FP8
ChibuUkachi  published a model 2 days ago
inference-optimization/Ministral-3-14B-Instruct-2512-FP8
mgoin  updated a model 3 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
View all activity

Michael Goin's profile picture Eldar Kurtić's profile picture Fynn Schmitt-Ulms's profile picture Alexandre Marques's profile picture Dipika's profile picture Krishna Teja Chitty-Venkata's profile picture Chibueze Ukachi's profile picture Linghao Kong's profile picture Rahul Tuli's profile picture Kyle Sayers's profile picture Neural Magic Research's profile picture Megan Flynn's profile picture Brian Dellabetta's profile picture Helen Zhao's profile picture

inference-optimization 's models 38

inference-optimization/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

33B • Updated Dec 4, 2025

inference-optimization/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head

33B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

71B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor

8B • Updated Dec 4, 2025

inference-optimization/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

8B • Updated Dec 4, 2025
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs