On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published Feb 24 • 101
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 5 days ago • 104
Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 51
Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs
rogue-security/prompt-injection-jailbreak-sentinel-v2 Text Classification • 0.6B • Updated 19 days ago • 28.2k • 28