Unveiling Simplicities of Attention: Adaptive Long-Context Head Identification Paper • 2502.09647 • Published Feb 11, 2025
Provable Benefits of In-Tool Learning for Large Language Models Paper • 2508.20755 • Published Aug 28, 2025 • 11
Efficient RL Training for LLMs with Experience Replay Paper • 2604.08706 • Published 12 days ago • 17