Datasets for SketchVLM: Vision-Language Models Can Annotate Images to Explain Thoughts and Guide Users
(https://sketchvlm.github.io/)
-
loganbolton/sketchvlm-physics-ball-drop
Viewer • Updated • 198 • 56 -
loganbolton/sketchvlm-maze-navigation
Viewer • Updated • 200 • 39 -
SketchVLM: Vision language models can annotate images to explain thoughts and guide users
Paper • 2604.22875 • Published • 33 -
loganbolton/sketchvlm-connect-dots
Viewer • Updated • 100 • 49