Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
Joya Chen PRO
chenjoya
AI & ML interests
Video LLM
Recent Activity
published
a dataset
5 days ago
chenjoya/spc_demo_videos
updated
a dataset
5 days ago
chenjoya/spc_demo_videos
upvoted
a
paper
12 days ago
Robix: A Unified Model for Robot Interaction, Reasoning and Planning