Running Featured 1.3k FineWeb: decanting the web for the finest text data at scale 🍷 1.3k Generate a curated web‑text dataset for LLM training
Running on CPU Upgrade Featured 3.02k The Smol Training Playbook 📚 3.02k The secrets to building world-class LLMs
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 7 days ago • 98
Trinity Collection Collection of Arcee AI models in the Trinity family • 8 items • Updated Dec 11, 2025 • 26
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 522
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 172