Models

259

Full-text search

Active filters: VLM

nvidia/NVIDIA-Nemotron-Parse-v1.2

Image-Text-to-Text • 0.9B • Updated 8 days ago • 10.2k • 30

nvidia/Eagle2.5-8B

Image-Text-to-Text • 8B • Updated Nov 29, 2025 • 62.1k • 38

nvidia/Eagle2-1B

Image-Text-to-Text • 1B • Updated Apr 27, 2025 • 214 • 29

xlangai/OpenCUA-32B

Image-Text-to-Text • 33B • Updated Jan 24 • 754 • 28

mradermacher/GUI-Libra-8B-i1-GGUF

8B • Updated 5 days ago • 1.56k • 1

pfnet/plamo-2.1-2b-vl

Image-Text-to-Text • 3B • Updated 14 days ago • 324 • 4

Efficient-Large-Model/VILA-13b

Text Generation • 13B • Updated Mar 4, 2024 • 16 • 20

Efficient-Large-Model/VILA-7b

Text Generation • 7B • Updated Mar 4, 2024 • 60 • 27

Efficient-Large-Model/VILA-7b-4bit-awq

Text Generation • Updated Mar 4, 2024 • 13 • 2

Efficient-Large-Model/VILA-13b-4bit-awq

Text Generation • Updated Mar 4, 2024 • 7 • 2

Efficient-Large-Model/VILA-2.7b

Text Generation • 3B • Updated Mar 4, 2024 • 95 • 15

TIGER-Lab/Mantis-bakllava-7b

Image-Text-to-Text • 8B • Updated May 18, 2024 • 17 • 5

TIGER-Lab/Mantis-llava-7b

Image-Text-to-Text • 7B • Updated May 18, 2024 • 48 • 16

Efficient-Large-Model/VILA1.5-3b

Text Generation • Updated Jul 18, 2024 • 1.24k • 34

Efficient-Large-Model/VILA1.5-13b

Text Generation • Updated Jul 18, 2024 • 1.19k • 5

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16, 2024 • 429 • 37

Efficient-Large-Model/VILA1.5-40b

Text Generation • Updated Jul 18, 2024 • 110 • 17

Efficient-Large-Model/VILA1.5-3b-s2

Text Generation • Updated Jul 18, 2024 • 15 • 2

Efficient-Large-Model/VILA1.5-3b-AWQ

Text Generation • Updated Jul 18, 2024 • 29 • 7

Efficient-Large-Model/VILA1.5-3b-s2-AWQ

Text Generation • Updated Jul 18, 2024 • 11 • 2

Efficient-Large-Model/Llama-3-VILA1.5-8b-AWQ

Text Generation • Updated Jul 18, 2024 • 19 • 7

Efficient-Large-Model/VILA1.5-13b-AWQ

Text Generation • Updated Jul 18, 2024 • 8 • 3

Efficient-Large-Model/VILA1.5-40b-AWQ

Text Generation • Updated Jul 18, 2024 • 9 • 3

RussRobin/SpatialBot-3B-LoRA

Visual Question Answering • Updated Sep 5, 2024 • 3

RussRobin/SpatialBot-3B

Visual Question Answering • 3B • Updated Sep 10, 2024 • 163 • 19

aimagelab/LLaVA_MORE-llama_3_1-8B-finetuning

Image-Text-to-Text • 8B • Updated Aug 2, 2025 • 1.02k • 11

Ligeng-Zhu/VILA15_3b

Text Generation • Updated Aug 7, 2024 • 2

NVEagle/Eagle-X5-13B-Chat

Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 15 • 28

NVEagle/Eagle-X5-13B

Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 25 • 15

NVEagle/Eagle-X5-7B

Image-Text-to-Text • 9B • Updated Sep 16, 2024 • 35 • 26