One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
authored
a paper
7 days ago
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
liked
a dataset
7 days ago
kolerk/Video_Reality_Test