Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
-
PaddlePaddle/PaddleOCR-VL-1.5
Image-Text-to-Text β’ 1.0B β’ Updated β’ 5.66k β’ 334 -
PaddleOCR-VL-1.5 Online Demo
π»45PaddleOCR-VL-1.5_Online_Demo
-
PaddlePaddle/PP-DocLayoutV3
Image Segmentation β’ Updated β’ 2.78k β’ 22 -
PaddlePaddle/PP-DocLayoutV3_safetensors
Object Detection β’ Updated β’ 1.18k β’ 10