Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Novita
Cerebras
Nebius AI
Featherless AI
Fireworks
Together AI
Groq
Hyperbolic
+ 6
Apply filters
Models
4,952
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Skywork/Skywork-VL-Reward-7B
Image-Text-to-Text
•
8B
•
Updated
Jun 10
•
195
•
43
moondream/moondream-2b-2025-04-14
Image-Text-to-Text
•
2B
•
Updated
May 21
•
58
•
6
google/gemma-3-27b-it-qat-q4_0-unquantized
Image-Text-to-Text
•
27B
•
Updated
Apr 15
•
7.46k
•
34
mlx-community/gemma-3-4b-it-qat-bf16
Image-Text-to-Text
•
5B
•
Updated
Apr 18
•
38
•
1
mlx-community/gemma-3-27b-it-qat-4bit
Image-Text-to-Text
•
Updated
Apr 19
•
163k
•
19
mlx-community/gemma-3-27b-it-qat-bf16
Image-Text-to-Text
•
Updated
Apr 18
•
1.06k
•
5
OpenGVLab/InternVL3-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
May 29
•
51.2k
•
5
OpenGVLab/InternVL3-8B-Instruct
Image-Text-to-Text
•
8B
•
Updated
May 29
•
15.6k
•
10
OpenGVLab/InternVL3-1B-Pretrained
Image-Text-to-Text
•
0.9B
•
Updated
Apr 25
•
1.94k
•
3
OpenGVLab/InternVL3-38B-AWQ
Image-Text-to-Text
•
Updated
May 29
•
28.1k
•
3
lmstudio-community/gemma-3-4B-it-qat-GGUF
Image-Text-to-Text
•
4B
•
Updated
Apr 18
•
4.77k
•
16
lmstudio-community/gemma-3-12B-it-qat-GGUF
Image-Text-to-Text
•
12B
•
Updated
Apr 18
•
5.77k
•
9
lmstudio-community/gemma-3-27B-it-qat-GGUF
Image-Text-to-Text
•
27B
•
Updated
Apr 18
•
53k
•
14
allura-org/Gemma-3-Glitter-27B
Image-Text-to-Text
•
27B
•
Updated
Apr 18
•
886
•
•
5
mlx-community/gemma-3-12b-it-qat-bf16
Image-Text-to-Text
•
13B
•
Updated
Apr 18
•
53
•
1
bullerwins/gemma-3-27b-it-fp8-Dynamic
Image-Text-to-Text
•
27B
•
Updated
Apr 27
•
833
•
1
RedHatAI/gemma-3-27b-it-FP8-dynamic
Image-Text-to-Text
•
27B
•
Updated
Jun 9
•
16k
•
5
xlangai/Jedi-3B-1080p
Image-Text-to-Text
•
4B
•
Updated
Jun 18
•
1.44k
•
14
ydeng9/OpenVLThinker-7B-v1.2
Image-Text-to-Text
•
8B
•
Updated
8 days ago
•
74
•
3
Nexesenex/Gemma-3-4b_X-Ray-Abli_Linear_v1.01
Image-Text-to-Text
•
4B
•
Updated
May 12
•
41
•
3
hal-utokyo/MangaLMM
Image-Text-to-Text
•
8B
•
Updated
Jun 1
•
1.11k
•
7
Mungert/UI-TARS-1.5-7B-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jun 15
•
2.17k
•
6
Ricky06662/TaskRouter-1.5B
Image-Text-to-Text
•
2B
•
Updated
Jun 12
•
276
•
2
google/medgemma-4b-pt
Image-Text-to-Text
•
4B
•
Updated
May 21
•
3.52k
•
107
nvidia/VILA-HD-8B-PS3-1.5K-SigLIP
Image-Text-to-Text
•
Updated
about 24 hours ago
•
163
•
3
ngxson/Devstral-Small-Vision-2505-GGUF
Image-Text-to-Text
•
24B
•
Updated
May 21
•
439
•
28
numind/NuExtract-2.0-4B
Image-Text-to-Text
•
4B
•
Updated
Jun 25
•
1.89k
•
13
stockmark/Stockmark-2-VL-100B-beta
Image-Text-to-Text
•
96B
•
Updated
Jun 3
•
941
•
21
Tuwhy/Llama-3.2V-11B-Sherlock-iter2
Image-Text-to-Text
•
11B
•
Updated
May 29
•
13
•
2
numind/NuExtract-2.0-2B
Image-Text-to-Text
•
2B
•
Updated
Jun 25
•
5.75k
•
25
Previous
1
...
5
6
7
8
9
...
100
Next