Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published 4 days ago • 30
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published 16 days ago • 68
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 7 days ago • 175
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward Paper • 2508.03686 • Published 4 days ago • 30
CellForge: Agentic Design of Virtual Cell Models Paper • 2508.02276 • Published 5 days ago • 36
Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models Paper • 2508.00819 • Published 8 days ago • 58
On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective Paper • 2507.23632 • Published 9 days ago • 6
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Paper • 2507.22827 • Published 10 days ago • 88
Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention Paper • 2507.17745 • Published 17 days ago • 30
MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published 20 days ago • 45
Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning Paper • 2507.16784 • Published 18 days ago • 115
Gaussian Splatting with Discretized SDF for Relightable Assets Paper • 2507.15629 • Published 19 days ago • 22
Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Paper • 2507.11061 • Published 25 days ago • 37
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization Paper • 2507.15061 • Published 20 days ago • 48
Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers Paper • 2507.08422 • Published 29 days ago • 35
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities Paper • 2507.13158 • Published 23 days ago • 24
FLEXITOKENS: Flexible Tokenization for Evolving Language Models Paper • 2507.12720 • Published 23 days ago • 8