rootsautomation/GutenOCR-7B
Image-Text-to-Text • Updated
• 824 • 25
VLMs and long context, document processing and understanding, confidence, calibration, alignment, and decision making.
GutenOCR: A Grounded Vision-Language Front-End for Documents
PubMed-OCR: PMC Open Access OCR Annotations