Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Paper • 2505.15778 • Published May 21 • 18
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Paper • 2509.19736 • Published 25 days ago • 11