Efficient RL Training for LLMs with Experience Replay Paper • 2604.08706 • Published 14 days ago • 18
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published Jan 26 • 42