14 5

ScottZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

upvoted a paper 21 days ago

Video-CoE: Reinforcing Video Event Prediction via Chain of Events

upvoted a paper 29 days ago

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

View all activity

Organizations

upvoted a paper 16 days ago

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

Paper • 2603.22212 • Published 17 days ago • 125

upvoted a paper 21 days ago

Video-CoE: Reinforcing Video Event Prediction via Chain of Events

Paper • 2603.14935 • Published 24 days ago • 91

upvoted a paper 29 days ago

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published Mar 3 • 145

updated a dataset about 1 month ago

GD-ML/MobilityBench

Viewer • Updated Mar 5 • 50k • 145 • 16

upvoted 2 papers about 1 month ago

From Scale to Speed: Adaptive Test-Time Scaling for Image Editing

Paper • 2603.00141 • Published Feb 24 • 138

MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios

Paper • 2602.22638 • Published Feb 26 • 107

liked 4 datasets about 1 month ago

published a dataset about 1 month ago

GD-ML/MobilityBench

Viewer • Updated Mar 5 • 50k • 145 • 16

liked a dataset about 2 months ago

GD-ML/IntTravel_dataset

Preview • Updated Feb 26 • 3.11k • 90

upvoted a paper about 2 months ago

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 202

upvoted 3 papers 2 months ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 153

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Paper • 2601.20354 • Published Jan 28 • 112

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published Jan 28 • 120

upvoted 3 papers 3 months ago

Urban Socio-Semantic Segmentation with Vision-Language Reasoning

Paper • 2601.10477 • Published Jan 15 • 156

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 169

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published Dec 30, 2025 • 64

upvoted a paper 8 months ago

Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation

Paper • 2508.07981 • Published Aug 11, 2025 • 63

ScottZhang

AI & ML interests

Recent Activity

Organizations

ScottZhang's activity