Qwen/Qwen3-VL-30B-A3B-Instruct Image-Text-to-Text β’ 31B β’ Updated Nov 26, 2025 β’ 888k β’ β’ 514
Running Featured 558 Vision Arena (Testing VLMs side-by-side) πΌ 558 Display image analysis results
Running on CPU Upgrade Featured 2.9k The Smol Training Playbook π 2.9k The secrets to building world-class LLMs
yayayaaa/florence-2-large-ft-moredetailed Image-to-Text β’ 0.8B β’ Updated Dec 13, 2025 β’ 87 β’ 15
meta-llama/Llama-3.2-11B-Vision Image-Text-to-Text β’ 11B β’ Updated Sep 27, 2024 β’ 9.86k β’ 578
Runtime error Featured 515 Florence2 + SAM2 π₯ 515 Segment and caption objects in images and videos
Running on Zero Featured 5.03k FLUX.1 [Schnell] π 5.03k Generate unique images from text descriptions