OCR - a shoaibmohd Collection

shoaibmohd 's Collections

Computer Use Agent

Learning from examples - training/inference

OCR

Data Analysis Papers

OCR

updated 2 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published 27 days ago • 121
CommonForms: A Large, Diverse Dataset for Form Field Detection

Paper • 2509.16506 • Published Sep 20 • 18
Automated Structured Radiology Report Generation with Rich Clinical Context

Paper • 2510.00428 • Published 22 days ago • 7
Extract-0: A Specialized Language Model for Document Information Extraction

Paper • 2509.22906 • Published 27 days ago
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published 7 days ago • 60
FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published 3 days ago • 46
RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published 5 days ago • 39