Papers
updated
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper
•
2510.13786
•
Published
•
31
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper
•
2510.14973
•
Published
•
40
Paper
•
2510.13998
•
Published
•
57
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper
•
2510.19430
•
Published
•
50
Every Question Has Its Own Value: Reinforcement Learning with Explicit
Human Values
Paper
•
2510.20187
•
Published
•
18
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper
•
2510.19363
•
Published
•
61
Qwen3-Omni Technical Report
Paper
•
2509.17765
•
Published
•
145
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground
Paper
•
2512.10430
•
Published
•
113
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Paper
•
2512.14067
•
Published
•
13
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper
•
2512.17351
•
Published
•
25
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper
•
2512.16676
•
Published
•
207
Reinforcement Learning for Self-Improving Agent with Skill Library
Paper
•
2512.17102
•
Published
•
32
mHC: Manifold-Constrained Hyper-Connections
Paper
•
2512.24880
•
Published
•
257
TransMLA: Multi-head Latent Attention Is All You Need
Paper
•
2502.07864
•
Published
•
57