view article Article SyGra: The One-Stop Framework for Building Data for LLMs and SLMs Sep 22, 2025 • 14
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 18 days ago • 49
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 9 days ago • 45
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 70
Running on CPU Upgrade Featured 3.11k The Smol Training Playbook 📚 3.11k The secrets to building world-class LLMs
Running 3.79k The Ultra-Scale Playbook 🌌 3.79k The ultimate guide to training LLM on large GPU Clusters