DreamFoley: Scalable VLMs for High-Fidelity Video-to-Audio Generation Paper • 2512.06022 • Published 25 days ago • 3
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Paper • 2504.15279 • Published Apr 21 • 78