Collaborative Transformers for Grounded Situation Recognition Paper • 2203.16518 • Published Mar 30, 2022
PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization Paper • 2307.15199 • Published Jul 27, 2023 • 13
Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild Paper • 2403.14539 • Published Mar 21, 2024
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers Paper • 2207.13820 • Published Jul 27, 2022
Running on Zero MCP 1.91k Wan2.2 14B Preview 🐌 1.91k generate a video from an image with a text prompt
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published Aug 11, 2025 • 76
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published Nov 13, 2025 • 101
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 242