The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics 8 days ago • 21
Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline 11 days ago • 36
Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation 12 days ago • 13
Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds 13 days ago • 5
NVIDIA Nemotron 2 Nano 9B Japanese: State-of-the-Art Small Language Model Customized for Japanese Sovereign AI Feb 17 • 3
Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models Jan 6 • 26
The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator Dec 17, 2025 • 47
Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models Dec 15, 2025 • 110
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare Oct 28, 2025 • 20
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks Oct 28, 2025 • 17
Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI Oct 28, 2025 • 21
Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes Oct 22, 2025 • 11
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard Oct 21, 2025 • 14
Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models Oct 20, 2025 • 20
📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models Aug 18, 2025 • 5
NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual Aug 18, 2025 • 4
NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks Aug 11, 2025 • 76
Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval Jul 9, 2025 • 4
Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions Jun 10, 2025 • 25
view article Article Nemotron 3 Content Safety 4B: Multimodal, Multilingual Content Moderation 4 days ago • 1
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 5 days ago • 42
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 7 days ago • 52
view article Article The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics 8 days ago • 21
view article Article Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline 11 days ago • 36
view article Article Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation 12 days ago • 13
view article Article Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds 13 days ago • 5
view article Article From Scarcity to Scale: How Synthetic Personas Can Bootstrap Japanese AI Development Feb 19 • 2
view article Article NVIDIA Nemotron 2 Nano 9B Japanese: State-of-the-Art Small Language Model Customized for Japanese Sovereign AI Feb 17 • 3
view article Article Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model Feb 4 • 28