Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Cerebras
Novita
Featherless AI
Fireworks
Nebius AI
Together AI
Groq
Hyperbolic
+ 6
Apply filters
Models
4,810
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
nintwentydo/Pixtral-Large-Instruct-2411-exl2-8.0bpw
Image-Text-to-Text
•
Updated
Dec 21, 2024
•
2
prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
May 2
•
645
•
6
prithivMLmods/Qwen2-VL-Math-Prase-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
May 2
•
14
•
4
OpenGVLab/HoVLE
Image-Text-to-Text
•
3B
•
Updated
Dec 24, 2024
•
19
•
13
nvidia/NVLM-D-72B-mcore
Image-Text-to-Text
•
Updated
Jan 14
•
6
OpenGVLab/HoVLE-HD
Image-Text-to-Text
•
3B
•
Updated
Feb 9
•
17
•
8
mjschock/Qwen2-VL-2B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Dec 19, 2024
•
7
•
1
THUDM/cogagent-9b-20241220
Image-Text-to-Text
•
14B
•
Updated
Dec 25, 2024
•
244
•
53
taroshi/InternVL2_5-4B
Image-Text-to-Text
•
4B
•
Updated
Dec 22, 2024
•
5
city96/llava-llama-3-8b-v1_1-imat-gguf
Image-Text-to-Text
•
8B
•
Updated
Dec 20, 2024
•
1.47k
•
29
X-iZhang/libra-v1.0-7b
Image-Text-to-Text
•
7B
•
Updated
6 days ago
•
270
•
2
OpenGVLab/InternVL2_5-78B-MPO
Image-Text-to-Text
•
78B
•
Updated
Mar 25
•
242
•
54
OpenGVLab/InternVL2_5-38B-MPO
Image-Text-to-Text
•
38B
•
Updated
Mar 25
•
15.1k
•
20
OpenGVLab/InternVL2_5-26B-MPO
Image-Text-to-Text
•
26B
•
Updated
Mar 25
•
499
•
14
OpenGVLab/InternVL2_5-8B-MPO
Image-Text-to-Text
•
8B
•
Updated
Mar 25
•
9.6k
•
48
OpenGVLab/InternVL2_5-4B-MPO
Image-Text-to-Text
•
4B
•
Updated
Mar 25
•
8.21k
•
18
OpenGVLab/InternVL2_5-2B-MPO
Image-Text-to-Text
•
2B
•
Updated
Mar 25
•
890
•
12
OpenGVLab/InternVL2_5-1B-MPO
Image-Text-to-Text
•
0.9B
•
Updated
Mar 25
•
605
•
24
mlx-community/deepseek-vl2-small-4bit
Image-Text-to-Text
•
3B
•
Updated
Dec 22, 2024
•
45
mlx-community/deepseek-vl2-4bit
Image-Text-to-Text
•
4B
•
Updated
Dec 22, 2024
•
53
•
1
prince-canuma/deepseek-vl2-small
Image-Text-to-Text
•
16B
•
Updated
Dec 22, 2024
•
2
prince-canuma/deepseek-vl2
Image-Text-to-Text
•
27B
•
Updated
Dec 22, 2024
•
3
prince-canuma/deepseek-vl2-tiny
Image-Text-to-Text
•
3B
•
Updated
Dec 22, 2024
•
3
mlx-community/deepseek-vl2-small-6bit
Image-Text-to-Text
•
4B
•
Updated
Dec 22, 2024
•
18
mlx-community/deepseek-vl2-6bit
Image-Text-to-Text
•
6B
•
Updated
Dec 22, 2024
•
28
•
1
mlx-community/deepseek-vl2-small-8bit
Image-Text-to-Text
•
5B
•
Updated
Dec 22, 2024
•
25
mlx-community/deepseek-vl2-small-3bit
Image-Text-to-Text
•
2B
•
Updated
Dec 22, 2024
•
13
mlx-community/deepseek-vl2-8bit
Image-Text-to-Text
•
8B
•
Updated
Jan 2
•
198
•
5
mlx-community/deepseek-vl2-small-bf16
Image-Text-to-Text
•
16B
•
Updated
Dec 22, 2024
•
13
mlx-community/deepseek-vl2-tiny-bf16
Image-Text-to-Text
•
3B
•
Updated
Dec 22, 2024
•
19
Previous
1
...
46
47
48
49
50
...
100
Next