G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published 1 day ago • 10
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 4 days ago • 57
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration Paper • 2605.05566 • Published 5 days ago • 35