|
--- |
|
title: EOMT Panoptic Segmentation |
|
emoji: ๐ |
|
colorFrom: blue |
|
colorTo: yellow |
|
sdk: gradio |
|
sdk_version: 5.35.0 |
|
app_file: app.py |
|
pinned: false |
|
--- |
|
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference |
|
|
|
# EOMT Panoptic Segmentation App |
|
|
|
A Gradio-based web application for interactive panoptic segmentation using the **EOMT (Encoder-only Mask Transformer)** model - a minimalist approach that repurposes a plain Vision Transformer (ViT) for image segmentation. |
|
|
|
Github: https://github.com/athrael-soju/little-scripts/tree/main/eomt_panoptic_seg |
|
|
|
Tech Blog: https://Athrael.net |
|
|
|
This implementation is based on the CVPR 2025 paper ["Your ViT is Secretly an Image Segmentation Model"](https://www.tue-mps.org/eomt/) by Kerssies et al., demonstrating that plain Vision Transformers can achieve state-of-the-art segmentation performance. |