File size: 725 Bytes
e6e3c7d
 
 
 
 
 
 
 
 
 
 
 
 
 
c41fe9c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
---
title: UI Screen Description Generator With Pix2Struct
emoji: 🐨
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 5.28.0
app_file: app.py
pinned: false
license: mit
short_description: Built a vision-language application
---

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference


# UI Screen Describer with Pix2Struct

This demo uses Google's `pix2struct-screen2words-large` model to turn UI screenshots into natural language descriptions.

### Use Cases
- Accessibility
- UI testing
- Auto documentation

### How it works
Upload any screenshot (e.g., app, webpage, dashboard) and the model will describe it in text.

Built using Hugging Face Transformers + Gradio.