Detecting and Preventing Hallucinations in Large Vision Language Models Paper • 2308.06394 • Published Aug 11, 2023
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains Paper • 2507.17746 • Published Jul 23, 2025 • 3