Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models Paper • 2603.15557 • Published 6 days ago • 28
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published Mar 13, 2025 • 30
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • 24B • Updated Apr 20, 2025 • 45 • • 59