Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yanpeng Zhao's picture
1 2

Yanpeng Zhao

surprisal
·

AI & ML interests

None yet

Recent Activity

submitted a paper 3 days ago
Probing Visual Planning in Image Editing Models
authored a paper about 2 months ago
v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound
authored a paper about 2 months ago
Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
View all activity

Organizations

None yet

submitted a paper to Daily Papers 3 days ago

Probing Visual Planning in Image Editing Models

Paper • 2604.22868 • Published 11 days ago • 1
authored 4 papers about 2 months ago

v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound

Paper • 2509.25773 • Published Sep 30, 2025

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer

Paper • 2112.08995 • Published Dec 16, 2021

ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models

Paper • 2603.13033 • Published Mar 13 • 13

MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning

Paper • 2509.22761 • Published Sep 26, 2025 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs