Yanpeng Zhao's picture

Yanpeng Zhao

surprisal

·

AI & ML interests

None yet

Recent Activity

submitted a paper 3 days ago

Probing Visual Planning in Image Editing Models

authored a paper about 2 months ago

v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound

authored a paper about 2 months ago

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer

View all activity

Organizations

None yet

submitted a paper to Daily Papers 3 days ago

Probing Visual Planning in Image Editing Models

Paper • 2604.22868 • Published 11 days ago • 1

authored 4 papers about 2 months ago

v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound

Paper • 2509.25773 • Published Sep 30, 2025

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer

Paper • 2112.08995 • Published Dec 16, 2021

ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models

Paper • 2603.13033 • Published Mar 13 • 13

MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning

Paper • 2509.22761 • Published Sep 26, 2025 • 1