|
--- |
|
license: apache-2.0 |
|
pipeline_tag: image-to-text |
|
tags: |
|
- layout |
|
--- |
|
|
|
## anydoclayout |
|
> docs layout detection |
|
|
|
<a href="https://huggingface.co/anyforge/anydoclayout" target="_blank"><img src="https://img.shields.io/badge/%F0%9F%A4%97-HuggingFace-blue"></a> |
|
<a href="https://www.modelscope.cn/models/anyforge/anydoclayout" target="_blank"><img alt="Static Badge" src="https://img.shields.io/badge/%E9%AD%94%E6%90%AD-ModelScope-blue"></a> |
|
<a href=""><img src="https://img.shields.io/badge/Python->=3.6-aff.svg"></a> |
|
<a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a> |
|
|
|
``` |
|
_ ____ _ _ |
|
/ \ _ __ _ _| _ \ ___ ___| | __ _ _ _ ___ _ _| |_ |
|
/ _ \ | '_ \| | | | | | |/ _ \ / __| | / _` | | | |/ _ \| | | | __| |
|
/ ___ \| | | | |_| | |_| | (_) | (__| |__| (_| | |_| | (_) | |_| | |_ |
|
/_/ \_\_| |_|\__, |____/ \___/ \___|_____\__,_|\__, |\___/ \__,_|\__| |
|
|___/ |___/ |
|
|
|
``` |
|
|
|
- Github:[anydoclayout](https://github.com/anyforge/anydoclayout) |
|
- Hugging Face: [anydoclayout](https://huggingface.co/anyforge/anydoclayout) |
|
- ModelScope: [anydoclayout](https://www.modelscope.cn/models/anyforge/anydoclayout) |
|
|
|
 |
|
|
|
## train datasets |
|
|
|
### 1. labels info |
|
```python |
|
{0: 'header', |
|
1: 'title', |
|
2: 'text', |
|
3: 'table', |
|
4: 'figure', |
|
5: 'formula', |
|
6: 'footer', |
|
7: 'pagenum'} |
|
``` |
|
|
|
### 2. datasets info |
|
|
|
- train: 841862 (opendata: 667426, selfgen: 174436) |
|
- eval: 5500 |
|
- imgsize:1280 |
|
|
|
### 3. eval results |
|
```python |
|
Class Images Instances Box(P R) |
|
all 5500 52274 0.921 0.897 |
|
header 1461 2337 0.92 0.878 |
|
title 2308 5473 0.896 0.893 |
|
text 4149 34156 0.937 0.927 |
|
table 1476 1913 0.946 0.958 |
|
figure 1842 3343 0.94 0.94 |
|
formula 735 1506 0.881 0.876 |
|
footer 745 1157 0.909 0.781 |
|
pagenum 2164 2389 0.938 0.919 |
|
|
|
``` |
|
|
|
### if you want to get datasets |
|
- email:christnowx@qq.com |
|
|
|
|
|
### how to use |
|
|
|
```python |
|
from pathlib import Path |
|
from ultralytics import YOLO |
|
|
|
modelfile = Path(model_dir).joinpath('anydoclayout-yolo11s-imgsz1280.pt') |
|
model = YOLO(modelfile) |
|
res = model.predict('your img file', imgsz = 1280) |
|
|
|
``` |
|
|
|
### Buy me a coffee |
|
|
|
- 微信(WeChat) |
|
|
|
<div align="left"> |
|
<img src="./zanshan.jpg" width="30%" height="30%"> |
|
</div> |
|
|