Beyond Monolithic Rewards: A Hybrid and Multi-Aspect Reward Optimization for MLLM Alignment Paper • 2510.05283 • Published 18 days ago