Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Cerebras
Together AI
Fireworks
Nebius AI
Novita
Groq
Hyperbolic
Nscale
+ 6
Apply filters
Models
8,415
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
reducto/RolmOCR
Image-to-Text
•
8B
•
Updated
Apr 2
•
119k
•
515
numind/NuMarkdown-8B-Thinking
Image-to-Text
•
8B
•
Updated
21 days ago
•
8.92k
•
204
allenai/olmOCR-7B-0825
Image-to-Text
•
8B
•
Updated
27 days ago
•
6.84k
•
26
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11
•
591k
•
439
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3
•
1.19M
•
1.4k
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
1.9M
•
774
allenai/olmOCR-7B-0225-preview
Image-to-Text
•
8B
•
Updated
22 days ago
•
181k
•
701
microsoft/kosmos-2-patch14-224
Image-to-Text
•
2B
•
Updated
Nov 28, 2023
•
191k
•
179
AdamCodd/donut-receipts-extract
Image-to-Text
•
0.2B
•
Updated
Jan 11
•
14
•
37
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-to-Text
•
8B
•
Updated
Apr 3
•
3.53k
•
7
RedHatAI/Qwen2.5-VL-72B-Instruct-quantized.w4a16
Image-to-Text
•
13B
•
Updated
Jul 10
•
1.73k
•
8
ChatDOC/OCRFlux-3B
Image-to-Text
•
4B
•
Updated
Jul 9
•
12.3k
•
347
sugiv/cardvaultplus-500m-gguf
Image-to-Text
•
0.4B
•
Updated
Jul 22
•
183
•
2
asmud/ds4sd-docling-models-onnx
Image-to-Text
•
Updated
7 days ago
•
2
snskrt/sanskrit-ocr-qwen2vl
Image-to-Text
•
2B
•
Updated
3 days ago
•
5
•
2
thesby/Qwen2.5-VL-7B-NSFW-Caption-V3
Image-to-Text
•
8B
•
Updated
Jun 17
•
5.26k
•
58
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
478k
•
911
microsoft/git-base
Image-to-Text
•
0.2B
•
Updated
Apr 24, 2023
•
159k
•
104
agestau/fashion_captioning_v3
Image-to-Text
•
Updated
May 15, 2023
•
7
•
1
xinyu1205/recognize_anything_model
Image-to-Text
•
Updated
Oct 25, 2023
•
49
paragon-AI/blip2-image-to-text
Image-to-Text
•
Updated
Jun 24, 2023
•
1.42k
•
30
mychen76/invoice-and-receipts_donut_v1
Image-to-Text
•
0.2B
•
Updated
Apr 19, 2024
•
1.75k
•
62
Ransaka/TrOCR-Sinhala
Image-to-Text
•
0.3B
•
Updated
Jan 6, 2024
•
476
•
2
MohamedRashad/arabic-small-nougat
Image-to-Text
•
0.2B
•
Updated
Nov 28, 2024
•
429
•
24
fhswf/TrOCR_german_handwritten
Image-to-Text
•
0.6B
•
Updated
Jun 18, 2024
•
1.33k
•
10
matthh/git-image-to-g-code
Image-to-Text
•
0.2B
•
Updated
Jul 13, 2024
•
8
•
6
fhswf/TrOCR_Math_handwritten
Image-to-Text
•
0.6B
•
Updated
Oct 21, 2024
•
238
•
7
Ertugrul/Qwen2-VL-7B-Captioner-Relaxed
Image-to-Text
•
8B
•
Updated
Sep 26, 2024
•
33.3k
•
61
JackChew/Qwen2-VL-2B-OCR
Image-to-Text
•
2B
•
Updated
Dec 29, 2024
•
3.3k
•
15
chatpig/llava-llama3
Image-to-Text
•
8B
•
Updated
Jan 29
•
656
•
3
Previous
1
2
3
...
100
Next