HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video Paper • 2510.05560 • Published 23 days ago • 7
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Paper • 2510.06217 • Published 23 days ago • 62
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 24 days ago • 456
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published 21 days ago • 120
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published 23 days ago • 133
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives Paper • 2510.20822 • Published 7 days ago • 37
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published 6 days ago • 84
Video-As-Prompt: Unified Semantic Control for Video Generation Paper • 2510.20888 • Published 7 days ago • 41
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation Paper • 2510.21583 • Published 6 days ago • 30
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging Paper • 2510.20479 • Published 7 days ago • 11