Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception Paper • 2510.12720 • Published Oct 14, 2025 • 2
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO Paper • 2505.17017 • Published May 22, 2025
EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning Paper • 2505.04623 • Published May 7, 2025