Jackrong/MLX-Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-8bit Text Generation • 27B • Updated Mar 21 • 4.64k • 13
inferencerlabs/NVIDIA-Nemotron-3-Super-120B-A12B-MLX-4.5bit Text Generation • 121B • Updated Mar 14 • 2.79k • 7
ParoQuant Collection Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 20 items • Updated about 7 hours ago • 21