Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model about 2 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-quant-test-1 published a model about 2 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-quant-test-1 updated a model about 22 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-quant-test-7-bits-heuristic