|
--- |
|
datasets: |
|
- Caraaaaa/non_text_image_captioning |
|
pipeline_tag: image-to-text |
|
--- |
|
|
|
This is a [GenerativeImage2Text](https://huggingface.co/microsoft/git-base) model finetuned on [non-text images](https://huggingface.co/datasets/Caraaaaa/non_text_image_captioning) extracted from documents (i.e.PDF). It is used to analyze the content of the image and produce a descriptive caption. |
|
It is part of a [project]((https://github.com/caraaaaa/doc_accessibility?tab=readme-ov-file)) to build a software solution capable of processing offline documents (PDFs, Word, PowerPoint, PPT, etc.) to detect WCAG accessibility issues. |
|
|
|
Example document with non-text images: |
|
 |
|
Extracted Image: |
|
 |
|
Generated caption: |
|
"Indication of correct signature" |