AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents Paper • 2603.18429 • Published Mar 19 • 26
CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage Paper • 2605.15597 • Published 5 days ago • 8
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era Paper • 2509.12989 • Published Sep 16, 2025 • 28
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks Paper • 2510.25760 • Published Oct 29, 2025 • 17
Temporal Gains, Spatial Costs: Revisiting Video Fine-Tuning in Multimodal Large Language Models Paper • 2603.17541 • Published Mar 18 • 20