PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 7 days ago • 10
Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing Paper • 2604.02288 • Published 7 days ago • 25