OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation Paper • 2512.08294 • Published 21 days ago • 17
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 27 days ago • 32
SCA: Improve Semantic Consistent in Unrestricted Adversarial Attacks via DDPM Inversion Paper • 2410.02240 • Published Oct 3, 2024 • 1
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published Oct 21 • 21
Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models Paper • 2507.17853 • Published Jul 23 • 1
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning Paper • 2508.07607 • Published Aug 11 • 1
Training-Free Watermarking for Autoregressive Image Generation Paper • 2505.14673 • Published May 20 • 12