A fine-grained visual reasoning benchmark (We extend with more question type in V2 dataset)
Sicheng Feng
FSCCS

·
AI & ML interests
None yet
Recent Activity
liked
a model
about 23 hours ago
Qwen/Qwen2.5-Math-7B-Instruct
liked
a dataset
19 days ago
simplescaling/s1K
authored
a paper
about 1 month ago
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token
Compression across Images, Videos, and Audios