Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers Paper • 2602.03510 • Published 3 days ago • 24
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published about 23 hours ago • 22
RISE-Video: Can Video Generators Decode Implicit World Rules? Paper • 2602.05986 • Published about 24 hours ago • 22
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR Paper • 2602.05261 • Published 1 day ago • 43
SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs Paper • 2602.06040 • Published about 23 hours ago • 9
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published 1 day ago • 3