yxsllgz-uts-org/Math_Consistency-Probability-Llama-3.2-3B-Instruct-style1 Viewer • Updated Dec 18, 2024 • 6.82k • 6 • 1
Tool-R0 Collection Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data (https://arxiv.org/pdf/2602.21320) • 5 items • Updated Mar 3 • 2
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Paper • 2505.20411 • Published May 26, 2025 • 96
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments Paper • 2510.01179 • Published Oct 1, 2025 • 28