Junjie Fei's picture

Junjie Fei

FeiElysia

AI & ML interests

None yet

Recent Activity

liked a Space about 6 hours ago

Vision-CAIR/Tempo

upvoted a paper about 22 hours ago

Neural Computers

upvoted a paper about 23 hours ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

View all activity

Organizations

None yet

liked a Space about 6 hours ago

Tempo

Smart Compressors for Long Video Understanding

upvoted a paper about 22 hours ago

Neural Computers

Paper • 2604.06425 • Published 6 days ago • 23

upvoted a paper about 23 hours ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 4 days ago • 15

authored 5 papers 3 days ago

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Paper • 2305.02677 • Published May 4, 2023 • 1

Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents

Paper • 2411.16740 • Published Nov 23, 2024 • 2

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Paper • 2503.19065 • Published Mar 24, 2025 • 11

Neural Computers

Paper • 2604.06425 • Published 6 days ago • 23

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 4 days ago • 15

submitted a paper to Daily Papers 3 days ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 4 days ago • 15

authored a paper over 1 year ago

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Paper • 2307.16525 • Published Jul 31, 2023 • 1