PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 6 days ago • 10
CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models Paper • 2604.04780 • Published 2 days ago • 7
PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 6 days ago • 10
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence Paper • 2512.04563 • Published Dec 4, 2025 • 16
UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval Paper • 2508.04136 • Published Aug 6, 2025
Referring Expression Instance Retrieval and A Strong End-to-End Baseline Paper • 2506.18246 • Published Jun 23, 2025