MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper • 2509.22186 • Published 27 days ago • 121
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper • 2509.16506 • Published Sep 20 • 18
Automated Structured Radiology Report Generation with Rich Clinical Context Paper • 2510.00428 • Published 22 days ago • 7
Extract-0: A Specialized Language Model for Document Information Extraction Paper • 2509.22906 • Published 26 days ago
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 7 days ago • 60