R^3L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification Paper • 2601.03715 • Published 13 days ago • 1
HiconAgent: History Context-aware Policy Optimization for GUI Agents Paper • 2512.01763 • Published Dec 1, 2025 • 6