UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper • 2511.08521 • Published 5 days ago • 34
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 3 days ago • 35
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 3 days ago • 35
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published 3 days ago • 8