Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published about 22 hours ago • 33
Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek Paper • 2601.15100 • Published 1 day ago • 1
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning Paper • 2601.14750 • Published 1 day ago • 13
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published 7 days ago • 9
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 7 days ago • 18
AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems Paper • 2601.11354 • Published 6 days ago • 4
BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search Paper • 2601.11037 • Published 6 days ago • 16
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 8 days ago • 31
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 7 days ago • 30
Inference-time Physics Alignment of Video Generative Models with Latent World Models Paper • 2601.10553 • Published 7 days ago • 11
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 7 days ago • 26
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding Paper • 2601.09575 • Published 8 days ago • 24
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines Paper • 2601.09465 • Published 8 days ago • 39