view article Article Distribution Matching Prevents Mode Collapse in Training Reasoning Models about 23 hours ago • 1