Multimodal VLMs - Until July'25 Collection Multimodal VLMs for Domain-Specific Tasks: OCR, Reasoning, and Captioning • 12 items • Updated 29 days ago • 3
prithivMLmods/Qwen2.5-VL-3B-Abliterated-Caption-it Image-Text-to-Text • 4B • Updated Aug 16 • 178 • 4