Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
Qwen/Qwen3-Next-80B-A3B-Instruct Text Generation • 81B • Updated Sep 17, 2025 • 265k • • 1.01k
meituan-longcat/LongCat-Flash-Chat Text Generation • 562B • Updated Sep 24, 2025 • 59.1k • 533
CohereLabs/command-a-reasoning-08-2025 Text Generation • 111B • Updated Jan 13 • 897 • • 141