Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
Together AI
Cerebras
Nebius AI Studio
SambaNova
Novita
Nscale
fal
Hyperbolic
Cohere
Fireworks
HF Inference API
Misc
Reset Misc
VLM
Inference Endpoints
custom_code
text-generation-inference
Eval Results
Misc with no match
Merge
4-bit precision
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
87
Full-text search
Edit filters
Sort: Trending
Active filters:
VLM
Clear all
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
2 days ago
•
2.63k
•
78
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
May 2
•
75.1k
•
80
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation
•
Updated
Jan 6
•
54
•
1
nvidia/Eagle2-2B
Image-Text-to-Text
•
Updated
Apr 27
•
3.32k
•
28
mradermacher/Qwen2-VL-OCR-2B-Instruct-GGUF
Updated
7 days ago
•
508
•
2
prithivMLmods/Callisto-OCR3-2B-Instruct
Image-Text-to-Text
•
Updated
May 2
•
414
•
5
mradermacher/ImageQuality-R1-v1-i1-GGUF
Updated
27 days ago
•
11.8k
•
1
lusxvr/nanoVLM-222M
Image-Text-to-Text
•
Updated
about 1 month ago
•
5.3k
•
83
nvidia/VILA-HD-8B-PS3-1.5K-SigLIP
Image-Text-to-Text
•
Updated
3 days ago
•
45
•
1
nvidia/VILA-HD-8B-PS3-4K-SigLIP
Image-Text-to-Text
•
Updated
3 days ago
•
34
•
1
One-RL-to-See-Them-All/Orsta-7B
Image-Text-to-Text
•
Updated
4 days ago
•
782
•
7
One-RL-to-See-Them-All/Orsta-32B-0326
Image-Text-to-Text
•
Updated
4 days ago
•
159
•
4
Efficient-Large-Model/VILA-13b
Text Generation
•
Updated
Mar 4, 2024
•
56
•
20
Efficient-Large-Model/VILA-7b
Text Generation
•
Updated
Mar 4, 2024
•
142
•
26
Efficient-Large-Model/VILA-7b-4bit-awq
Text Generation
•
Updated
Mar 4, 2024
•
28
•
2
Efficient-Large-Model/VILA-13b-4bit-awq
Text Generation
•
Updated
Mar 4, 2024
•
22
•
2
Efficient-Large-Model/VILA-2.7b
Text Generation
•
Updated
Mar 4, 2024
•
106
•
15
TIGER-Lab/Mantis-bakllava-7b
Image-Text-to-Text
•
Updated
May 18, 2024
•
19
•
5
TIGER-Lab/Mantis-llava-7b
Image-Text-to-Text
•
Updated
May 18, 2024
•
107
•
15
Efficient-Large-Model/VILA1.5-3b
Text Generation
•
Updated
Jul 18, 2024
•
14.9k
•
27
Efficient-Large-Model/VILA1.5-13b
Text Generation
•
Updated
Jul 18, 2024
•
4.51k
•
3
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation
•
Updated
Aug 16, 2024
•
2.15k
•
32
Efficient-Large-Model/VILA1.5-40b
Text Generation
•
Updated
Jul 18, 2024
•
432
•
17
Efficient-Large-Model/VILA1.5-3b-s2
Text Generation
•
Updated
Jul 18, 2024
•
69
•
1
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
42
•
5
Efficient-Large-Model/VILA1.5-3b-s2-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
16
•
1
Efficient-Large-Model/Llama-3-VILA1.5-8b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
21
•
7
Efficient-Large-Model/VILA1.5-13b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
45
•
3
Efficient-Large-Model/VILA1.5-40b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
42
•
3
RussRobin/SpatialBot-3B-LoRA
Visual Question Answering
•
Updated
Sep 5, 2024
•
2
•
3
Previous
1
2
3
Next