Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning Paper • 2510.14095 • Published 6 days ago • 5
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper • 2510.15624 • Published 4 days ago • 12
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning Paper • 2509.02479 • Published Sep 2 • 83
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper • 2507.22607 • Published Jul 30 • 46
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs Paper • 2506.14429 • Published Jun 17 • 44
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders Paper • 2506.14002 • Published Jun 16 • 5
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26 • 56
Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework Paper • 2503.10704 • Published Mar 12 • 5