Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model about 2 hours ago
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid published a model about 2 hours ago
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid updated a collection about 6 hours ago
Qwen3.6-HIGGS