Running 3.6k The Ultra-Scale Playbook π 3.6k The ultimate guide to training LLM on large GPU Clusters
mattshumer/Reflection-Llama-3.1-70B Text Generation β’ 71B β’ Updated Sep 24, 2024 β’ 457 β’ 1.71k
MaziyarPanahi/MixTAO-7Bx2-MoE-Instruct-v7.0-GGUF Text Generation β’ 13B β’ Updated Feb 4, 2024 β’ 179 β’ 9