Submitted by Vasily 104 When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA AIRI - Artificial Intelligence Research Institute 8 3
Submitted by dongguanting 93 Agentic Entropy-Balanced Policy Optimization Renmin University of China 703 4
Submitted by taesiri 73 WithAnyone: Towards Controllable and ID Consistent Image Generation StepFun 276 3
Submitted by zichenwen 66 AI for Service: Proactive Assistance with AI Glasses Shanghai Jiao Tong University 2
Submitted by Paranioar 60 From Pixels to Words -- Towards Native Vision-Language Primitives at Scale SenseTime 159 2
Submitted by xiaochonglinghu 50 ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints AMAP-ML 52 2
Submitted by taesiri 43 PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model PaddlePaddle 59.5k 5
Submitted by Keven16 37 LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Tencent Hunyuan 15 2
Submitted by mukul54 34 Attention Is All You Need for KV Cache in Diffusion LLMs Mohamed Bin Zayed University of Artificial Intelligence 2
Submitted by KID-22 30 Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Ant Group 9 2
Submitted by pengyunie 28 TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar University of Waterloo 5 2
Submitted by taesiri 21 MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning · 14 authors 16 2
Submitted by jyhong836 18 LLMs Can Get "Brain Rot"! Visual Informatics Group @ University of Texas at Austin 11 2
Submitted by CheeryLJH 17 VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning NJU-LINK Lab 16 2
Submitted by kenchan0226 16 Large Language Models Do NOT Really Know What They Don't Know Singapore Management University 2
Submitted by han1997 12 VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation Westlake University 2
Submitted by bclavie 12 Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Mixedbread 2
Submitted by XINLI1997 12 COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes Multimodal Art Projection 0 2
Submitted by XINLI1997 10 Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures ByteDance Seed 0 2
Submitted by wimmerth 10 AnyUp: Universal Feature Upsampling Max Planck Institute for Informatics 250 2
Submitted by MilaWang 9 LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild · 10 authors 2
Submitted by shenweijie 8 Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning · 13 authors 2
Submitted by Lakonik 7 pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation Adobe 81 2
Submitted by JonasGeiping 6 Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models ELLIS Institute Tübingen 835 2
Submitted by jiwonsong 6 LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning Seoul National University 1 2
Submitted by stefan-it 6 The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models CORAL NLP Research 3 2
Submitted by HJGO 6 VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator · 6 authors 28 2
Submitted by DaYin 5 LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training UCLA NLP 2
Submitted by hk 5 DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation UCLA NLP 3 2
Submitted by awni00 5 Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning · 4 authors 1 2
Submitted by qiranzou 5 FML-bench: A Benchmark for Automatic ML Research Agents Highlighting the Importance of Exploration Breadth National University of Singapore 4 2
Submitted by kylemontgomery 4 Budget-aware Test-time Scaling via Discriminative Verification · 7 authors 2 2
Submitted by shaoweiliu 3 Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation Snapchat Inc. 18 2
Submitted by kylemontgomery 3 Predicting Task Performance with Context-aware Scaling Laws · 7 authors 1 2
Submitted by Robot2050 2 MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems · 6 authors 2
Submitted by SP2001 2 Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms · 7 authors 2
Submitted by kedaxiaoqiu 2 SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis University of Illinois at Urbana-Champaign 2
Submitted by ZYao720 1 GroundedPRM: Tree-Guided and Fidelity-Aware Process Reward Modeling for Step-Level Reasoning Ludwig Maximilian University of Munich 2
Submitted by augustus2011 1 Beyond One World: Benchmarking Super Heros in Role-Playing Across Multiversal Contexts Character-lab 1 2
Submitted by zhangchen1991 1 RAGCap-Bench: Benchmarking Capabilities of LLMs in Agentic Retrieval Augmented Generation Systems National University of Singapore 2
Submitted by aashiqmuhamed 1 RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models Amazon AGI 2
Submitted by NickNickGo - Mirror Speculative Decoding: Breaking the Serial Barrier in LLM Inference Apple 2