A series of LMMs finetuned with the Inst-IT Dataset, skilled in fine-grained image/video understanding at the instance-level.

Inst-IT
university
AI & ML interests
Large Multimodal Models
Organization Card