[mixed] Chess x AI Collection Research directly related to Chess technology. • 3 items • Updated 1 day ago • 1
Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning Paper • 2601.03872 • Published 18 days ago • 42
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper • 2601.04767 • Published 17 days ago • 28
OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent Paper • 2601.07779 • Published 13 days ago • 27
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking Paper • 2601.06487 • Published 15 days ago • 50
GLEE: A Unified Framework and Benchmark for Language-based Economic Environments Paper • 2410.05254 • Published Oct 7, 2024 • 85
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published Dec 23, 2025 • 83
Confidence Estimation for LLMs in Multi-turn Interactions Paper • 2601.02179 • Published 20 days ago • 16
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents Paper • 2601.12294 • Published 7 days ago • 16
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling Paper • 2512.23959 • Published 27 days ago • 109
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 19 days ago • 134
A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification Paper • 2601.13288 • Published 6 days ago • 12
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment Paper • 2601.12910 • Published 6 days ago • 3
Video-As-Prompt: Unified Semantic Control for Video Generation Paper • 2510.20888 • Published Oct 23, 2025 • 49
Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek Paper • 2601.15100 • Published 4 days ago • 3