MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published 9 days ago • 179
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems Paper • 2602.08847 • Published Feb 9 • 28
AgentOCR: Reimagining Agent History via Optical Self-Compression Paper • 2601.04786 • Published Jan 8 • 30
GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies Paper • 2512.02581 • Published Dec 2, 2025 • 15