EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper • 2602.18071 • Published 7 days ago • 21
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 7 days ago • 30
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 18 days ago • 211
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper • 2512.03041 • Published Dec 2, 2025 • 66
GENIUS: Generative Fluid Intelligence Evaluation Suite Paper • 2602.11144 • Published 16 days ago • 53
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 22 days ago • 339
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper • 2602.06949 • Published 21 days ago • 35
Running on CPU Upgrade 1.07k Omni Image Editor 🖼 1.07k Image edit, text to image, image upscale, remove watermark
Running on Zero MCP Featured 925 Qwen-Image-Edit-2511-LoRAs-Fast 🎃 925 Demo of the Collection of Qwen Image Edit LoRAs
Running on Zero Featured 1.57k Qwen3-TTS Demo 🎙 1.57k Generate custom speech from text, voice descriptions, or samples
Running on Zero MCP 2.38k Z Image Turbo 🖼 2.38k Generate high-quality images from text prompts in seconds
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 27 days ago • 305