DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published 8 days ago • 183
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 11 days ago • 63
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 11 days ago • 63
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 11 days ago • 63
EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models Paper • 2512.14666 • Published 10 days ago • 8
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Paper • 2408.12528 • Published Aug 22, 2024 • 51
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction Paper • 2503.15661 • Published Mar 19 • 2
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 17 days ago • 46
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published 24 days ago • 63
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation Paper • 2511.20256 • Published Nov 25 • 27
Computer-Use Agents as Judges for Generative User Interface Paper • 2511.15567 • Published Nov 19 • 52
Computer-Use Agents as Judges for Generative User Interface Paper • 2511.15567 • Published Nov 19 • 52
Computer-Use Agents as Judges for Generative User Interface Paper • 2511.15567 • Published Nov 19 • 52 • 2
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation Paper • 2511.11434 • Published Nov 14 • 44