Yuxin Chen's picture

7

Yuxin Chen

Uasonchen

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

OmniScript: Towards Audio-Visual Script Generation for Long-Form Cinematic Video

updated a collection about 1 month ago

Video Understanding

authored a paper 4 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

View all activity

Organizations

upvoted a paper 1 day ago

OmniScript: Towards Audio-Visual Script Generation for Long-Form Cinematic Video

Paper • 2604.11102 • Published 10 days ago • 6

updated a collection about 1 month ago

Video Understanding

2 items • Updated Mar 20

authored a paper 4 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 51

upvoted 2 papers 4 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 51

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published Dec 18, 2025 • 38

updated 2 collections 6 months ago

Video Generation

9 items • Updated Oct 10, 2025 • 1

MLLM

5 items • Updated Oct 10, 2025

upvoted a paper 7 months ago

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23, 2025 • 30

authored 2 papers 7 months ago

EA-VTR: Event-Aware Video-Text Retrieval

Paper • 2407.07478 • Published Jul 10, 2024 • 1

Taming Rectified Flow for Inversion and Editing

Paper • 2411.04746 • Published Nov 7, 2024