File size: 876 Bytes
18d5bb6 84c259f 18d5bb6 b34d493 ad46ceb ed9101c 04e7784 b34d493 d123cb6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
---
title: EOMT Panoptic Segmentation
emoji: 🐠
colorFrom: blue
colorTo: yellow
sdk: gradio
sdk_version: 5.35.0
app_file: app.py
pinned: false
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
# EOMT Panoptic Segmentation App
A Gradio-based web application for interactive panoptic segmentation using the **EOMT (Encoder-only Mask Transformer)** model - a minimalist approach that repurposes a plain Vision Transformer (ViT) for image segmentation.
Github: https://github.com/athrael-soju/little-scripts/tree/main/eomt_panoptic_seg
Tech Blog: https://Athrael.net
This implementation is based on the CVPR 2025 paper ["Your ViT is Secretly an Image Segmentation Model"](https://www.tue-mps.org/eomt/) by Kerssies et al., demonstrating that plain Vision Transformers can achieve state-of-the-art segmentation performance. |