iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper • 2601.17124 • Published 8 days ago • 30
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 10 days ago • 42
CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance Paper • 2503.10391 • Published Mar 13, 2025 • 12
Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models Paper • 2601.07287 • Published 19 days ago • 5
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published 19 days ago • 51