13 10

Kane Chen

KaneC

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

V-Reflection: Transforming MLLMs from Passive Observers to Active Interrogators

liked a model 8 days ago

garlandchou/V-Reflection

liked a Space about 1 month ago

haodongli/DVD

View all activity

Organizations

None yet

upvoted a paper 8 days ago

V-Reflection: Transforming MLLMs from Passive Observers to Active Interrogators

Paper • 2604.03307 • Published 15 days ago • 14

liked a model 8 days ago

garlandchou/V-Reflection

Visual Question Answering • 921k • Updated 8 days ago • 56 • 5

liked a Space about 1 month ago

DVD

🦀

Official demo of DVD (https://dvd-project.github.io/)

liked a model about 1 month ago

FayeHongfeiZhang/DVD

2B • Updated 8 days ago • 13

upvoted a paper 2 months ago

Show, Don't Tell: Morphing Latent Reasoning into Image Generation

Paper • 2602.02227 • Published Feb 2 • 10

upvoted 3 papers 4 months ago

A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Paper • 2512.14442 • Published Dec 16, 2025 • 11

UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving

Paper • 2512.09864 • Published Dec 10, 2025 • 12

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Paper • 2511.23127 • Published Nov 28, 2025 • 44

upvoted a paper 5 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 44

upvoted a paper 6 months ago

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Paper • 2510.09507 • Published Oct 10, 2025 • 11

liked a dataset 6 months ago

zhangzixin02/PhysToolBench

Viewer • Updated Oct 14, 2025 • 1.01k • 33 • 7

upvoted 3 papers 7 months ago

liked a model 8 months ago

allenai/MolmoAct-7B-D-0812

Robotics • 8B • Updated Oct 24, 2025 • 570 • 53

upvoted a paper 8 months ago

MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11, 2025 • 45

upvoted a paper 11 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20, 2025 • 134

liked a Space 12 months ago

DiMeR Demo

🐨

Generate 3D models from text and images

upvoted a paper 12 months ago

DiMeR: Disentangled Mesh Reconstruction Model

Paper • 2504.17670 • Published Apr 24, 2025 • 24

liked a Space about 1 year ago

Rembg

👀

215

Remove backgrounds from images instantly