OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 7 days ago • 69
Running on Zero Agents Featured 618 OmniVoice 🌍 618 High-quality voice cloning TTS for 600+ languages
Running on Zero MCP 743 Wan2.2 14B Fast Preview 🐌 743 generate a video from an image with a text prompt
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 21 days ago • 69
DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Paper • 2603.28713 • Published 21 days ago • 20