anydoclayout / README.md
qianliyx's picture
Update README.md
660f180 verified
---
license: apache-2.0
pipeline_tag: image-to-text
tags:
- layout
---
## anydoclayout
> docs layout detection
<a href="https://huggingface.co/anyforge/anydoclayout" target="_blank"><img src="https://img.shields.io/badge/%F0%9F%A4%97-HuggingFace-blue"></a>
<a href="https://www.modelscope.cn/models/anyforge/anydoclayout" target="_blank"><img alt="Static Badge" src="https://img.shields.io/badge/%E9%AD%94%E6%90%AD-ModelScope-blue"></a>
<a href=""><img src="https://img.shields.io/badge/Python->=3.6-aff.svg"></a>
<a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a>
```
_ ____ _ _
/ \ _ __ _ _| _ \ ___ ___| | __ _ _ _ ___ _ _| |_
/ _ \ | '_ \| | | | | | |/ _ \ / __| | / _` | | | |/ _ \| | | | __|
/ ___ \| | | | |_| | |_| | (_) | (__| |__| (_| | |_| | (_) | |_| | |_
/_/ \_\_| |_|\__, |____/ \___/ \___|_____\__,_|\__, |\___/ \__,_|\__|
|___/ |___/
```
- Github:[anydoclayout](https://github.com/anyforge/anydoclayout)
- Hugging Face: [anydoclayout](https://huggingface.co/anyforge/anydoclayout)
- ModelScope: [anydoclayout](https://www.modelscope.cn/models/anyforge/anydoclayout)
![](./yolo11s-vis1.jpg)
## train datasets
### 1. labels info
```python
{0: 'header',
1: 'title',
2: 'text',
3: 'table',
4: 'figure',
5: 'formula',
6: 'footer',
7: 'pagenum'}
```
### 2. datasets info
- train: 841862 (opendata: 667426, selfgen: 174436)
- eval: 5500
- imgsize:1280
### 3. eval results
```python
Class Images Instances Box(P R)
all 5500 52274 0.921 0.897
header 1461 2337 0.92 0.878
title 2308 5473 0.896 0.893
text 4149 34156 0.937 0.927
table 1476 1913 0.946 0.958
figure 1842 3343 0.94 0.94
formula 735 1506 0.881 0.876
footer 745 1157 0.909 0.781
pagenum 2164 2389 0.938 0.919
```
### if you want to get datasets
- email:christnowx@qq.com
### how to use
```python
from pathlib import Path
from ultralytics import YOLO
modelfile = Path(model_dir).joinpath('anydoclayout-yolo11s-imgsz1280.pt')
model = YOLO(modelfile)
res = model.predict('your img file', imgsz = 1280)
```
### Buy me a coffee
- 微信(WeChat)
<div align="left">
<img src="./zanshan.jpg" width="30%" height="30%">
</div>