📸 Vision - a WestAI-SC Collection

WestAI-SC 's Collections

🔊 LAION Audio

📸 HyenaPixel

📸 OpenFlamingo (LAION)

📸 Group Mamba

📸 Vision

updated 2 days ago

Vison-related research artifacts of WestAI, including image-only or vision-language models and datasets.

Sleeping

Caption Via Translation

🚀
Caption via Translation

Collection

Models and datasets of the paper "Scaling Laws for Conditional Emergence of Multilingual Image Captioning via Generalization from Translation" • 10 items • Updated Jan 15
ptzld/VLM-GIST

Viewer • Updated Nov 7, 2025 • 65 • 39 • 1
📸 Sa2VA-i

Collection

Sa2VA-i: Improving Sa2VA Results with Consistent Training and Inference • 4 items • Updated 2 days ago
Divs1159/stingbee-7b

Updated Apr 1, 2025 • 55 • 2
📸 HyenaPixel

Collection

Models of the paper "HyenaPixel: Global Image Context with Convolutions" • 10 items • Updated 2 days ago
📸 Group Mamba

Collection

GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model • 3 items • Updated 2 days ago
📸 OpenFlamingo (LAION)

Collection

OpenFlamingo, a family of autoregressive vision-language models ranging from 3B to 9B parameters. • 7 items • Updated 2 days ago
openMaMMUT/openCLIP models and scaling laws

Collection

openMaMMUT/openCLIP models trained on DataComp-1.4B, DFN-1.4B and Re-LAION-2B. Pre-trained models on various scales, incl. intermediate checkpoints • 11 items • Updated 5 days ago • 1
laion/CLIP-ViT-H-14-laion2B-s32B-b79K

Zero-Shot Image Classification • 1.0B • Updated Jan 22, 2025 • 462k • 449
stanfordmimi/RoentGen-v2-synthetic-dataset

Viewer • Updated Sep 12, 2025 • 565k • 93