SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds Paper โข 2512.01078 โข Published Nov 30, 2025 โข 34
Next-Embedding Prediction Makes Strong Vision Learners Paper โข 2512.16922 โข Published Dec 18, 2025 โข 87
Next-Embedding Prediction Makes Strong Vision Learners Paper โข 2512.16922 โข Published Dec 18, 2025 โข 87
SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds Paper โข 2512.01078 โข Published Nov 30, 2025 โข 34
AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies Paper โข 2508.08113 โข Published Aug 11, 2025 โข 11
From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens Paper โข 2510.02292 โข Published Oct 2, 2025 โข 1
Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry Paper โข 2510.25595 โข Published Oct 29, 2025
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation Paper โข 2511.01163 โข Published Nov 3, 2025 โข 32
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation Paper โข 2511.01163 โข Published Nov 3, 2025 โข 32
Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation Paper โข 2506.21876 โข Published Jun 27, 2025 โข 28
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time Paper โข 2506.18890 โข Published Jun 23, 2025 โข 6