LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 19 days ago • 134
Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals Paper • 2601.05848 • Published 16 days ago • 16
VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator Paper • 2510.13454 • Published Oct 15, 2025 • 10
Aligning Text to Image in Diffusion Models is Easier Than You Think Paper • 2503.08250 • Published Mar 11, 2025 • 2
Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising Paper • 2511.08633 • Published Nov 9, 2025 • 55
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published Jul 11, 2025 • 81
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition Paper • 2506.17201 • Published Jun 20, 2025 • 57
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers Paper • 2506.05573 • Published Jun 5, 2025 • 82
FreeTimeGS: Free Gaussians at Anytime and Anywhere for Dynamic Scene Reconstruction Paper • 2506.05348 • Published Jun 5, 2025 • 6
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation Paper • 2505.24521 • Published May 30, 2025 • 15
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published May 24, 2025 • 48
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis Paper • 2505.09358 • Published May 14, 2025 • 26
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Paper • 2504.07961 • Published Apr 10, 2025 • 5