Submitted by Nothing2Say 20 PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning · 6 authors 2
Submitted by CSJianYang 12 T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables · 15 authors 2
Submitted by sahsaeedi 8 How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench · 8 authors 2
Submitted by blaz-r 4 No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes · 3 authors 78 2
Submitted by Omartificial-Intelligence-Space 4 UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat · 1 authors 2
Submitted by RSW233 3 From reactive to cognitive: brain-inspired spatial intelligence for embodied agents · 7 authors 19 2
Submitted by Soontosh 2 Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities · 2 authors 2