StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published 6 days ago • 6
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published Sep 11 • 28
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 98