Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Fireworks
Together AI
Featherless AI
Groq
Nscale
+ 8
Apply filters
Models
8,644
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
reducto/RolmOCR
Image-to-Text
•
8B
•
Updated
Apr 2
•
118k
•
535
PaddlePaddle/PP-OCRv5_server_det
Image-to-Text
•
Updated
Jul 22
•
158k
•
35
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3
•
1.26M
•
1.41k
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
1.89M
•
778
PaddlePaddle/PP-OCRv5_mobile_det
Image-to-Text
•
Updated
Jul 22
•
16.8k
•
12
mradermacher/Qwen2.5-VL-7B-Abliterated-Caption-it-GGUF
Image-to-Text
•
8B
•
Updated
Aug 1
•
18k
•
45
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11
•
433k
•
443
PaddlePaddle/PP-OCRv5_server_rec
Image-to-Text
•
Updated
Jul 22
•
106k
•
12
microsoft/git-base
Image-to-Text
•
0.2B
•
Updated
Apr 24, 2023
•
143k
•
106
ibm-granite/granite-vision-3.2-2b
Image-to-Text
•
3B
•
Updated
Jun 12
•
4k
•
103
Andres77872/SmolVLM-500M-anime-caption-v0.2
Image-to-Text
•
0.5B
•
Updated
May 12
•
881
•
6
numind/NuMarkdown-8B-Thinking
Image-to-Text
•
8B
•
Updated
about 1 month ago
•
8.74k
•
208
sanchit97/chart-rvr-3b
Image-to-Text
•
4B
•
Updated
27 days ago
•
297
•
3
allenai/olmOCR-7B-0825
Image-to-Text
•
8B
•
Updated
Aug 13
•
8.23k
•
29
thesby/Qwen2.5-VL-7B-NSFW-Caption-V3
Image-to-Text
•
8B
•
Updated
Jun 17
•
5.64k
•
65
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
372k
•
190
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
72.3k
•
58
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
387k
•
912
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
345k
•
230
naver-clova-ix/donut-base-finetuned-rvlcdip
Image-to-Text
•
Updated
Mar 9, 2024
•
1.8k
•
17
Ransaka/TrOCR-Sinhala
Image-to-Text
•
0.3B
•
Updated
Jan 6, 2024
•
116
•
3
cpans/idcard_ocr
Image-to-Text
•
Updated
Feb 1, 2024
•
1
MohamedRashad/arabic-small-nougat
Image-to-Text
•
0.2B
•
Updated
Nov 28, 2024
•
360
•
25
sashakunitsyn/vlrm-blip2-opt-2.7b
Image-to-Text
•
4B
•
Updated
Apr 3, 2024
•
1.03k
•
18
xtuner/llava-llama-3-8b-v1_1-gguf
Image-to-Text
•
8B
•
Updated
Apr 30, 2024
•
3.15k
•
216
thwri/CogFlorence-2.1-Large
Image-to-Text
•
0.8B
•
Updated
Sep 28, 2024
•
1.7k
•
25
unsloth/Llama-3.2-11B-Vision
Image-to-Text
•
11B
•
Updated
Nov 22, 2024
•
1.19k
•
34
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text
•
6B
•
Updated
Dec 10, 2024
•
9.1k
•
79
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
12.9k
•
85
xiangjx/musk
Image-to-Text
•
Updated
Jan 19
•
36
Previous
1
2
3
...
100
Next