peiran wu's picture

3 3 2

peiran wu

peiranW

·

WPR001

AI & ML interests

Video Understanding

Recent Activity

authored a paper 2 days ago

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

upvoted a paper 2 days ago

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

updated a collection 14 days ago

View all activity

Organizations

authored a paper 2 days ago

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

Paper • 2510.07915 • Published 11 days ago • 1

upvoted a paper 2 days ago

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

Paper • 2510.07915 • Published 11 days ago • 1

updated a collection 14 days ago

UGC-VideoCap

3 items • Updated 14 days ago

updated a model 14 days ago

Memories-ai/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated 14 days ago • 322

published a model 14 days ago

Memories-ai/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated 14 days ago • 322

updated a collection 14 days ago

UGC-VideoCap

3 items • Updated 14 days ago

updated a dataset 14 days ago

Memories-ai/UGC-VideoCap

Updated 14 days ago • 30

published a dataset 14 days ago

Memories-ai/UGC-VideoCap

Updated 14 days ago • 30

updated 2 collections 14 days ago

UGC-VideoCap

3 items • Updated 14 days ago

ST-Think

1 item • Updated 14 days ago

updated a dataset 2 months ago

openinterx/UGC-VideoCap

Updated Aug 20 • 124

updated a model 3 months ago

openinterx/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Jul 19 • 144 • 1

New activity in openinterx/UGC-VideoCap 3 months ago

Improve dataset card: Add metadata, abstract, links, and usage details

#1 opened 3 months ago by

New activity in openinterx/UGC-VideoCaptioner 3 months ago

Improve model card: Add metadata tags, abstract, links, usage, and benchmarks

#1 opened 3 months ago by

commented a paper 3 months ago

UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks

Paper • 2507.11336 • Published Jul 15 • 5 •

authored 2 papers 3 months ago

ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos

Paper • 2503.12542 • Published Mar 16 • 1

UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks

Paper • 2507.11336 • Published Jul 15 • 5

updated a collection 3 months ago

UGC-VideoCap

3 items • Updated Jul 16