Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shoaibmohd 's Collections
Computer Use Agent
Learning from examples - training/inference
OCR
Data Analysis Papers

OCR

updated 2 days ago
Upvote
-

  • MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

    Paper • 2509.22186 • Published 27 days ago • 121

  • CommonForms: A Large, Diverse Dataset for Form Field Detection

    Paper • 2509.16506 • Published Sep 20 • 18

  • Automated Structured Radiology Report Generation with Rich Clinical Context

    Paper • 2510.00428 • Published 22 days ago • 7

  • Extract-0: A Specialized Language Model for Document Information Extraction

    Paper • 2509.22906 • Published 27 days ago

  • PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

    Paper • 2510.14528 • Published 7 days ago • 60

  • FineVision: Open Data Is All You Need

    Paper • 2510.17269 • Published 3 days ago • 46

  • RL makes MLLMs see better than SFT

    Paper • 2510.16333 • Published 5 days ago • 39
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs