Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
krishnateja95
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model
about 12 hours ago
nm-testing/Llama-3.1-8B-Instruct-KV-Cache-FP8
updated
a model
about 12 hours ago
RedHatAI/Qwen3-8B-FP8-block
updated
a model
1 day ago
nm-testing/granite-4.0-h-small-FP8-block