FG-CLIP 2

qihoo360 's Collections

TinyR1

updated 3 days ago

FG-CLIP 2 is the foundation model for fine-grained vision-language understanding in both English and Chinese.