Spaces:
Running
Running

davanstrien
HF Staff
Add support for reasoning trace display from NuMarkdown-8B-Thinking model
34cedd8
How well do VLM-based OCR models handle Victorian theatre playbills? π | |
Last week I shared OCR Time Capsule for comparing traditional vs VLM-based OCR. I've now added some examples from challenging collections: The British Library's Theatrical playbills from Britain and Ireland collection. | |
These 150-year-old documents are brutal for OCR: | |
- Decorative fonts in every size imaginable | |
- Multi-column layouts with text at odd angles | |
- Faded ink and show-through from the reverse | |
- ALL CAPS DRAMATIC ANNOUNCEMENTS!!! | |
For this dataset I used the RolmOCR model from Reducto (processed via HF Jobs - love how easy UV scripts make GPU inference!). The results? The improvements over traditional OCR are even more dramatic than with exam papers. | |
π Explore the app: https://huggingface.co/spaces/davanstrien/ocr-time-capsule | |
π BL Theatre dataset: https://bl.iro.bl.uk/concern/datasets/a8534aff-c8e3-4fc8-adc1-da542080b1e3 | |
I'll continue to work through the suggestions I got last week but feel free to suggest other hairy OCR challenges to compare VLMs vs existing OCR! | |
#DigitalHumanities #OCR #GLAM #BritishLibrary #TheatreHistory |