Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 7 days ago • 46
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking Paper • 2512.24297 • Published 7 days ago • 5
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published 13 days ago • 26
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 11 days ago • 57
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture Paper • 2512.21675 • Published 12 days ago • 24
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 15 days ago • 62
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24, 2025 • 49