18 42 9

Weihao Yu

whyu

https://scholar.google.com/citations?user=LYxjt1QAAAAJ

AI & ML interests

Computer Vision, NLP and AI

Recent Activity

upvoted a paper about 16 hours ago

Make Geometry Matter for Spatial Reasoning

upvoted a paper 15 days ago

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

upvoted a paper about 1 month ago

NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

View all activity

Organizations

upvoted a paper about 16 hours ago

Make Geometry Matter for Spatial Reasoning

Paper • 2603.26639 • Published 4 days ago • 21

upvoted a paper 15 days ago

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Paper • 2603.15478 • Published 15 days ago • 24

upvoted a paper about 1 month ago

NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

Paper • 2602.22144 • Published Feb 25 • 1

submitted a paper to Daily Papers about 1 month ago

NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

Paper • 2602.22144 • Published Feb 25 • 1

upvoted a paper about 2 months ago

dVoting: Fast Voting for dLLMs

Paper • 2602.12153 • Published Feb 12 • 21

upvoted 3 papers 3 months ago

upvoted 4 papers 4 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 156

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published Nov 24, 2025 • 32

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 194

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 47

upvoted a paper 5 months ago

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7, 2025 • 53

liked a model 5 months ago

optimum-intel-internal-testing/tiny-random-PoolFormerModel

Updated Oct 21, 2025 • 8.74k • 1

upvoted 4 papers 5 months ago

Parallel Loop Transformer for Efficient Test-Time Computation Scaling

Paper • 2510.24824 • Published Oct 28, 2025 • 17

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Paper • 2510.22946 • Published Oct 27, 2025 • 18

Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets

Paper • 2510.19944 • Published Oct 22, 2025 • 22

upvoted 2 papers 6 months ago

Trace Anything: Representing Any Video in 4D via Trajectory Fields

Paper • 2510.13802 • Published Oct 15, 2025 • 31

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published Oct 15, 2025 • 27

Weihao Yu

AI & ML interests

Recent Activity

Organizations

whyu's activity