File size: 725 Bytes
e6e3c7d c41fe9c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 |
---
title: UI Screen Description Generator With Pix2Struct
emoji: 🐨
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 5.28.0
app_file: app.py
pinned: false
license: mit
short_description: Built a vision-language application
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
# UI Screen Describer with Pix2Struct
This demo uses Google's `pix2struct-screen2words-large` model to turn UI screenshots into natural language descriptions.
### Use Cases
- Accessibility
- UI testing
- Auto documentation
### How it works
Upload any screenshot (e.g., app, webpage, dashboard) and the model will describe it in text.
Built using Hugging Face Transformers + Gradio.
|