Diffusion CoT

non-profit

AI & ML interests

diffusion

Recent Activity

metazlb authored a paper 5 days ago

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

sayakpaul authored a paper about 1 month ago

Fine-Grained Perturbation Guidance via Attention Head Selection

sayakpaul authored a paper about 1 month ago

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

View all activity

authored a paper 5 days ago

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Paper • 2603.03646 • Published Mar 4 • 8

authored 2 papers about 1 month ago

Fine-Grained Perturbation Guidance via Attention Head Selection

Paper • 2506.10978 • Published Jun 12, 2025 • 25

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Paper • 2602.21778 • Published Feb 25 • 14

authored 3 papers about 2 months ago

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Paper • 2503.19065 • Published Mar 24, 2025 • 11

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Paper • 2504.16080 • Published Apr 22, 2025 • 15

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Paper • 2602.21778 • Published Feb 25 • 14

submitted a paper to Daily Papers about 2 months ago

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Paper • 2602.21778 • Published Feb 25 • 14

published a dataset about 2 months ago

metazlb/PhysicTran38K

Updated Mar 7 • 68.8k • 9

updated a dataset about 2 months ago

metazlb/PhysicTran38K

Updated Mar 7 • 68.8k • 9

updated a dataset about 2 months ago

metazlb/PhysicTran38K

Updated Mar 7 • 68.8k • 9

authored a paper about 2 months ago

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Paper • 2602.15449 • Published Feb 17 • 7

authored 4 papers 5 months ago

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

Paper • 2504.04903 • Published Apr 7, 2025

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Paper • 2510.05091 • Published Oct 6, 2025 • 20

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 55

PICABench: How Far Are We from Physically Realistic Image Editing?

Paper • 2510.17681 • Published Oct 20, 2025 • 65

authored a paper 6 months ago

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Paper • 2510.05091 • Published Oct 6, 2025 • 20

authored 3 papers 7 months ago

Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model

Paper • 2509.04548 • Published Sep 4, 2025 • 6

RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10, 2025 • 73

Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance

Paper • 2508.21016 • Published Aug 28, 2025

published a dataset 7 months ago

diffusion-cot/echo-4o-instruction-following

Viewer • Updated Aug 19, 2025 • 68k • 174