Scaling RL to Long Videos
Efficient-Large-Model
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
413
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 48 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 44 β’ 5
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 12k β’ 22 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 5.85k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 12.4k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 9 β’ 1
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 21 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 171 β’ β’ 23 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 14 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 819 β’ β’ 1
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 367 β’ β’ 213 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 84 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 40 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 83
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 1.5k β’ 37 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 3.85k β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 14.9k β’ 30 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 49 β’ 5
Scaling RL to Long Videos
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 12k β’ 22 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 5.85k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 12.4k β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 9 β’ 1
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 21 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 171 β’ β’ 23 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 14 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 819 β’ β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
413
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 48 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 44 β’ 5
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 367 β’ β’ 213 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 84 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 40 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 83
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 1.5k β’ 37 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 3.85k β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 14.9k β’ 30 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 49 β’ 5