Running 2.99k 2.99k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit Text Generation • 5B • Updated 22 days ago • 34.2k • 31