Computer Vision - a Norm Collection

Norm 's Collections

Image / Video Gen

Multimodal Language Model

Fundamental Research

Computer Vision

Computer Vision

updated Oct 8, 2024

Do we still need a network for specific computer vision tasks anymore today?

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 120

Note 1. Process video frames one at a time, equipped with a memory attention module to attend to the previous memories of the target object.
facebook/sam2.1-hiera-large

Mask Generation • 0.2B • Updated Aug 15, 2025 • 31.7k • 129