3 30 16

Cong Wei PRO

CongWei1230

https://congwei1230.github.io/

AI & ML interests

Generative Models; Reasoning

Recent Activity

upvoted a paper 7 days ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

upvoted a paper 8 days ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

upvoted a paper 8 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

View all activity

Organizations

upvoted a paper 7 days ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published 8 days ago • 61

upvoted 2 papers 8 days ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published 15 days ago • 45

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 9 days ago • 60

upvoted a paper about 1 month ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24 • 21

upvoted a paper about 2 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17 • 48

commented a paper about 2 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70 •

upvoted a paper about 2 months ago

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12 • 27

authored a paper 2 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70

upvoted 2 papers 2 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

upvoted 2 papers 3 months ago

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1 • 33

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 75

liked a dataset 5 months ago

APRIL-AIGC/UltraVideo-Long

Viewer • Updated Jul 14 • 16.6k • 434 • 5

upvoted a paper 5 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

liked a dataset 6 months ago

LanguageBind/UniWorld-V1

Viewer • Updated Jun 16 • 7.11k • 4.17k • 20

Cong Wei PRO

AI & ML interests

Recent Activity

Organizations

CongWei1230's activity