L2P: Unlocking Latent Potential for Pixel Generation Paper ⢠2605.12013 ⢠Published 8 days ago ⢠29
UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset Paper ⢠2510.20661 ⢠Published Oct 23, 2025 ⢠16
Subject-Consistent and Pose-Diverse Text-to-Image Generation Paper ⢠2507.08396 ⢠Published Jul 11, 2025 ⢠16
MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs Paper ⢠2506.01674 ⢠Published Jun 2, 2025 ⢠28
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper ⢠2503.23461 ⢠Published Mar 30, 2025 ⢠94
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/nvidia-cosmos-2 ⢠14 items ⢠Updated 11 days ago ⢠302
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper ⢠2501.02976 ⢠Published Jan 6, 2025 ⢠56
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors Paper ⢠2412.11586 ⢠Published Dec 16, 2024 ⢠11
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Paper ⢠2412.09283 ⢠Published Dec 12, 2024 ⢠19
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Paper ⢠2411.06558 ⢠Published Nov 10, 2024 ⢠36
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation Paper ⢠2407.02371 ⢠Published Jul 2, 2024 ⢠55
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network Paper ⢠2406.18284 ⢠Published Jun 26, 2024 ⢠19