nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated 17 days ago • 11.3k • 182
Reward Bench 2 Collection Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated Jun 3 • 14
SynLogic Collection Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond • 5 items • Updated Jun 3 • 11
Holo1 Collection Vision-Language Action Model for use in Surfer-H web navigation agent • 6 items • Updated Jun 10 • 48
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 1 day ago • 71
RoboBrain2.0 Collection RoboBrain 2.0: See Better. Think Harder. Do Smarter. • 6 items • Updated 17 days ago • 16