Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published May 5 • 79
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 44
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 44
stable-diffusion-v1-5/stable-diffusion-inpainting Text-to-Image • Updated Sep 6, 2024 • 648k • 71
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published Feb 18 • 39
diffusers/stable-diffusion-xl-1.0-inpainting-0.1 Text-to-Image • Updated Sep 3, 2023 • 686k • 340
stabilityai/stable-diffusion-xl-refiner-1.0 Image-to-Image • Updated Sep 25, 2023 • 483k • 1.94k