LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 9 days ago • 69
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 15 days ago • 125
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published Oct 20 • 62
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published Oct 20 • 62
Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance Paper • 2510.24711 • Published Oct 28 • 19
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13 • 176
Training-Free Efficient Video Generation via Dynamic Token Carving Paper • 2505.16864 • Published May 22 • 24
Training-Free Efficient Video Generation via Dynamic Token Carving Paper • 2505.16864 • Published May 22 • 24
Improved Diffusion-based Image Colorization via Piggybacked Models Paper • 2304.11105 • Published Apr 21, 2023
Video Colorization with Pre-trained Text-to-Image Diffusion Models Paper • 2306.01732 • Published Jun 2, 2023
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation Paper • 2307.06940 • Published Jul 13, 2023 • 10
Running Featured 364 Qwen2.5 Omni 7B Demo 🏆 364 Generate text and speech responses from text, audio, images, or video input
STEVE: AStep Verification Pipeline for Computer-use Agent Training Paper • 2503.12532 • Published Mar 16 • 17