D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use Paper • 2602.02160 • Published 11 days ago • 14 • 9
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use Paper • 2602.02160 • Published 11 days ago • 14 • 9