1 45 138

seruva19

seruva19

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago

well9472/Nanosaur

liked a model about 16 hours ago

Kijai/LTXV2_comfy

liked a model 3 days ago

GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn

View all activity

Organizations

upvoted 2 papers 9 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 10 days ago • 232

Pretraining Frame Preservation in Autoregressive Video Memory Compression

Paper • 2512.23851 • Published 12 days ago • 22

upvoted 2 papers 23 days ago

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

Paper • 2512.15635 • Published 24 days ago • 19

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 26 days ago • 73

upvoted an article 25 days ago

Article

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

26 days ago

•

upvoted 2 papers about 1 month ago

Adversarial Flow Models

Paper • 2511.22475 • Published Nov 27, 2025 • 22

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 72

upvoted a paper about 2 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 227

upvoted a collection 2 months ago

VisionLM

Collection

1867 items • Updated 19 days ago • 140

upvoted a paper 2 months ago

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 29

upvoted 2 papers 3 months ago

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published Oct 23, 2025 • 40

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17, 2025 • 50

upvoted a paper 4 months ago

Mixture of Contexts for Long Video Generation

Paper • 2508.21058 • Published Aug 28, 2025 • 35

upvoted a paper 6 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 124

upvoted a collection 8 months ago

Alchemist

Collection

📊 Dataset and 🏆 checkpoints for paper 📝 "Alchemist: Turning Public Text-to-Image Data into Generative Gold" • 8 items • Updated Oct 16, 2025 • 17

upvoted a paper 8 months ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26, 2025 • 56

upvoted 2 papers about 1 year ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63

Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Paper • 2412.01819 • Published Dec 2, 2024 • 34

upvoted 2 papers over 1 year ago

CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published Aug 29, 2024 • 57

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

seruva19

AI & ML interests

Recent Activity

Organizations

seruva19's activity

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation