vision - a Eun02 Collection

Eun02 's Collections

agent

dataset

LLM

vision

video

vision

updated Oct 27, 2025

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29, 2025 • 136
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 58
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 266
Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation

Paper • 2508.18032 • Published Aug 25, 2025 • 42
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published Oct 14, 2024 • 12
Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Paper • 2502.03444 • Published Feb 5, 2025
Seedream 3.0 Technical Report

Paper • 2504.11346 • Published Apr 15, 2025 • 70
DanceGRPO: Unleashing GRPO on Visual Generation

Paper • 2505.07818 • Published May 12, 2025 • 32
UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Paper • 2509.06818 • Published Sep 8, 2025 • 29
Instruct-Imagen: Image Generation with Multi-modal Instruction

Paper • 2401.01952 • Published Jan 3, 2024 • 32
Kontinuous Kontext: Continuous Strength Control for Instruction-based Image Editing

Paper • 2510.08532 • Published Oct 9, 2025 • 5
Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 165