facebook/dinov3-vitb16-pretrain-lvd1689m Image Feature Extraction • 85.7M • Updated Aug 19, 2025 • 1.41M • 114
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction • 7B • Updated Aug 19, 2025 • 30.1k • 221
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability Paper • 2504.07416 • Published Apr 10, 2025 • 3
sentence-transformers/all-mpnet-base-v2 Sentence Similarity • 0.1B • Updated Aug 19, 2025 • 29.2M • • 1.27k
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 16 items • Updated Mar 2 • 83