MiDashengLM: Efficient Audio Understanding with General Audio Captions Paper • 2508.03983 • Published 3 days ago • 5
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper • 2508.04280 • Published 2 days ago • 32
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning Paper • 2508.03501 • Published 3 days ago • 35
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 7 days ago • 172
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models Paper • 2508.02120 • Published 5 days ago • 5
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Paper • 2508.05635 • Published 1 day ago • 53
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published 15 days ago • 64
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published 3 days ago • 27
CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction Paper • 2508.03159 • Published 3 days ago • 20
AgroBench: Vision-Language Model Benchmark in Agriculture Paper • 2507.20519 • Published 12 days ago • 5
Phi-Ground Tech Report: Advancing Perception in GUI Grounding Paper • 2507.23779 • Published 8 days ago • 40
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models Paper • 2507.23682 • Published 8 days ago • 22
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning Paper • 2507.14111 • Published 21 days ago • 22
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels Paper • 2507.21809 • Published 10 days ago • 117
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published 11 days ago • 75
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm Paper • 2507.18553 • Published 15 days ago • 37