Skywork-Unipic3 Collection Unified Multi-Image Composition with Sequence Modeling • 4 items • Updated 9 days ago • 6
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published 7 days ago • 32
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published 29 days ago • 30
DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space Paper • 2509.25180 • Published Sep 29, 2025 • 9
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 16 days ago • 132
SpotEdit: Selective Region Editing in Diffusion Transformers Paper • 2512.22323 • Published 27 days ago • 39
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 27 days ago • 60
TwinFlow Collection A collection of TwinFlow-accelerated diffusion models • 4 items • Updated 24 days ago • 6
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield Paper • 2511.22677 • Published Nov 27, 2025 • 31
CASA Collection CASA: Cross-Attention as Self-Attention for Efficient Vision-Language Fusion on long context streaming inputs • 6 items • Updated about 1 month ago • 7
3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework Paper • 2512.17459 • Published Dec 19, 2025 • 12
MeshSplatting: Differentiable Rendering with Opaque Meshes Paper • 2512.06818 • Published Dec 7, 2025 • 11
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published Dec 12, 2025 • 30
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction Paper • 2511.23386 • Published Nov 28, 2025 • 16