The Majority is not always right: RL training for solution aggregation Paper • 2509.06870 • Published Sep 8 • 16