view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs lujangusface โข Apr 3 โข 8
view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 lujangusface โข Apr 9 โข 3
cyankiwi/Qwen3-30B-A3B-Instruct-2507-AWQ-4bit Text Generation โข 5B โข Updated 5 days ago โข 55k โข 31
cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-8bit Text Generation โข 84B โข Updated 5 days ago โข 53 โข 5