PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 6 days ago • 29
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published 15 days ago • 51
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents Paper • 2602.16855 • Published 15 days ago • 46
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 10 days ago • 30
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 28 days ago • 20
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published about 1 month ago • 205
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published Jan 14 • 33
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published Jan 8 • 167
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published Jan 9 • 23