Running on CPU Upgrade 208 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 208 Explore synthetic data experiments as an interactive bookshelf
view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ Feb 3 • 52
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 30 days ago • 95
Running on Zero MCP Featured 33 NeuTTS-Nano Multilingual Collection 🌍 33 Generate speech with voice cloning, now in four languages!
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 67
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9, 2025 • 105
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification Paper • 2505.16938 • Published May 22, 2025 • 121
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published Dec 23, 2025 • 41
A Cartography of Open Collaboration in Open Source AI: Mapping Practices, Motivations, and Governance in 14 Open Large Language Model Projects Paper • 2509.25397 • Published Sep 29, 2025 • 14
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 29 items • Updated 3 days ago • 136
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 14 days ago • 53k • 513