Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Releases August 2
Releases July 25
Releases July 18
Releases July 11
Releases July 4
Releases June 27
June 20 Releases
OCR Models & Datasets
Releases June 13
Releases June 6
Releases 30 May
Releases 23 May
May 16 Releases
May 9 Releases
Any-to-Any Models, Datasets, Spaces
Releases Apr 21 & May 2
InternVL3 HF
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
TΓΌrkΓ§e VLMler
Feb 14 Releases π
Feb 7 Releases π§£
January 31 Releases π§€
Models, Jan 27
Jan 24 Releases
Jan 17 Releases βοΈ
Jan 10 Releases π¨οΈ
Dec 6 Releases π
Nov 29 Releases π²π²
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Multimodal RAG
updated
Sep 5, 2024
Upvote
28
+18
vidore/colpali-v1.2
Visual Document Retrieval
β’
Updated
Mar 14
β’
33.1k
β’
109
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
β’
8B
β’
Updated
Feb 6
β’
526k
β’
β’
1.22k
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
β’
2B
β’
Updated
Jan 12
β’
782k
β’
436
Qwen/Qwen2-72B-Instruct
Text Generation
β’
73B
β’
Updated
Oct 8, 2024
β’
44.8k
β’
β’
715
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
β’
8B
β’
Updated
Jun 13
β’
78.9k
β’
995
Running
on
Zero
124
124
ColPali
π
Document Retrieval
vidore/colpali_train_set
Viewer
β’
Updated
Jun 20
β’
119k
β’
2.75k
β’
82
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
β’
8B
β’
Updated
Sep 2, 2024
β’
103k
β’
53
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
β’
8B
β’
Updated
Dec 2, 2024
β’
67.3k
β’
289
Upvote
28
+24
Share collection
View history
Collection guide
Browse collections