Datasets with reasoning traces for math and code (Train + Eval)
Maojia Song
OrangeEye
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
From Perception to Action: An Interactive Benchmark for Vision Reasoning upvoted a paper 2 months ago
Evaluating Gemini Robotics Policies in a Veo World Simulator