Zachary Bessinger

zbessinger

https://www.zachbessinger.com

AI & ML interests

Multimodal Computer Vision

Recent Activity

liked a model 12 days ago

Qwen/Qwen3-VL-30B-A3B-Instruct

liked a Space 23 days ago

WildVision/vision-arena

upvoted a paper 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

View all activity

Organizations

None yet

liked a model 12 days ago

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 888k • • 514

liked a Space 23 days ago

Vision Arena (Testing VLMs side-by-side)

🖼

558

Display image analysis results

liked 2 Spaces 2 months ago

Open VLM Leaderboard

🌎

971

VLMEvalKit Evaluation Results Collection

Transformers Timeline

🤗

Interactive timeline to explore the 🤗Transformers models

liked a Space 3 months ago

The Smol Training Playbook

📚

2.9k

The secrets to building world-class LLMs

liked a model 3 months ago

zai-org/GLM-4.6-FP8

Text Generation • 358B • Updated Oct 16, 2025 • 11k • • 97

liked 2 models 5 months ago

merve/smol-vision

Image-Text-to-Text • Updated Nov 5, 2025 • 189

kudzueye/boreal-qwen-image

Text-to-Image • Updated Sep 5, 2025 • 7.8k • • 124

liked a model 8 months ago

TIGER-Lab/VLM2Vec-Qwen2VL-7B

Image-to-Text • Updated May 3, 2025 • 4.1k • 10

liked a Space 8 months ago

MMEB Leaderboard

📊

The massive multimodal embedding benchmark

liked a model 8 months ago

DeepGlint-AI/UniME-LLaVA-OneVision-7B

Image-Text-to-Text • 8B • Updated May 7, 2025 • 12 • 3

liked 2 models 12 months ago

ByteDance/Sa2VA-8B

Image-Text-to-Text • 8B • Updated Sep 8, 2025 • 896 • 65

yayayaaa/florence-2-large-ft-moredetailed

Image-to-Text • 0.8B • Updated Dec 13, 2025 • 87 • 15

liked 3 models about 1 year ago

liked 4 Spaces over 1 year ago

Florence2 + SAM2

🔥

515

Segment and caption objects in images and videos

FLUX.1 [Inpainting]

🎨

642

FLUX.1 [Schnell]

🏎

5.03k

Generate unique images from text descriptions

Vgg Heads

🖼