This repository contains the OpenVision model, a fully-open and cost-effective family of advanced vision encoders for multimodal learning, as described in the paper OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.

Project Page: https://ucsc-vlaa.github.io/OpenVision/

Code: https://github.com/UCSC-VLAA/OpenVision

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including UCSC-VLAA/openvision-vit-base-patch8-384