Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation Paper • 2510.04373 • Published 17 days ago
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 17 days ago • 104
Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use Paper • 2509.12867 • Published Sep 16
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 22 days ago • 5