Huanyu_Zhang's picture

Huanyu_Zhang

huanyu112

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

GENIUS: Generative Fluid Intelligence Evaluation Suite

upvoted a paper 28 days ago

GEBench: Benchmarking Image Generation Models as GUI Environments

upvoted a paper about 1 month ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

View all activity

Organizations

upvoted a paper 26 days ago

GENIUS: Generative Fluid Intelligence Evaluation Suite

Paper • 2602.11144 • Published 26 days ago • 53

upvoted a paper 28 days ago

GEBench: Benchmarking Image Generation Models as GUI Environments

Paper • 2602.09007 • Published 28 days ago • 39

upvoted 2 papers about 1 month ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Paper • 2601.21037 • Published Jan 28 • 15

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

submitted a paper to Daily Papers about 1 month ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

liked a dataset about 1 month ago

VIBE-Benchmark/VIBE-Benchmark

Viewer • Updated Feb 2 • 2.65k • 2.67k • 2

updated 14 datasets about 1 month ago

VIBE-Benchmark/VIBE-Benchmark

Viewer • Updated Feb 2 • 2.65k • 2.67k • 2

VIBE-Benchmark/VIBE-Seedream4.0

Viewer • Updated Feb 1 • 1.03k • 940

VIBE-Benchmark/VIBE-Seedream4.5

Viewer • Updated Feb 1 • 1.03k • 1k

VIBE-Benchmark/OmniGen

Viewer • Updated Feb 1 • 1.03k • 972

VIBE-Benchmark/VIBE-Banana-Flash

Viewer • Updated Feb 1 • 1.01k • 1k

VIBE-Benchmark/VIBE-GPT-Image

Viewer • Updated Feb 1 • 1.01k • 1.03k

VIBE-Benchmark/Edit-R1-Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 964

VIBE-Benchmark/Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 999

VIBE-Benchmark/VIBE-Qwen-Image-Edit

Viewer • Updated Feb 1 • 934 • 877

VIBE-Benchmark/FLUX2-dev

Viewer • Updated Feb 1 • 1.03k • 2.39k

VIBE-Benchmark/OmniGen2

Viewer • Updated Feb 1 • 1.03k • 949 • 1

VIBE-Benchmark/UniWorld-V1

Viewer • Updated Feb 1 • 1.03k • 963

VIBE-Benchmark/BAGEL

Viewer • Updated Feb 1 • 1.03k • 2.15k

VIBE-Benchmark/Step1X-Edit-v1p2

Viewer • Updated Feb 1 • 934 • 1.53k • 1