MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence Paper • 2512.10863 • Published 17 days ago • 21
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing Paper • 2512.10284 • Published 17 days ago • 25
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning Paper • 2511.21688 • Published Nov 26 • 8
Model Extrapolation Expedites Alignment Collection Better aligned models obtained by model extrapolation (ExPO) • 25 items • Updated May 27 • 17