MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding Paper • 2501.18362 • Published Jan 30 • 23
view article Article Nemotron-Personas-Japan: Synthesized Data for Sovereign AI By nvidia and 6 others • 23 days ago • 25
weblab-llm-competition-2025-bridge/difficult_problem_dataset_v4_500 Viewer • Updated 28 days ago • 5.05k • 46