Running 3.68k The Ultra-Scale Playbook π 3.68k The ultimate guide to training LLM on large GPU Clusters
Running 65 UncheatableEval π 65 Explore model compression ratios with interactive tables and plots