File size: 2,520 Bytes
66201db
 
 
 
 
 
 
 
5f0181c
 
 
 
 
 
 
4d4b7a8
 
 
 
 
 
 
 
 
 
5f0181c
 
 
66201db
80bf371
66201db
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
660f180
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
---
license: apache-2.0
pipeline_tag: image-to-text
tags:
- layout
---

## anydoclayout
> docs layout detection

<a href="https://huggingface.co/anyforge/anydoclayout" target="_blank"><img src="https://img.shields.io/badge/%F0%9F%A4%97-HuggingFace-blue"></a>
<a href="https://www.modelscope.cn/models/anyforge/anydoclayout" target="_blank"><img alt="Static Badge" src="https://img.shields.io/badge/%E9%AD%94%E6%90%AD-ModelScope-blue"></a>
<a href=""><img src="https://img.shields.io/badge/Python->=3.6-aff.svg"></a>
<a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a>

```
    _                ____             _                            _
   / \   _ __  _   _|  _ \  ___   ___| |    __ _ _   _  ___  _   _| |_
  / _ \ | '_ \| | | | | | |/ _ \ / __| |   / _` | | | |/ _ \| | | | __|
 / ___ \| | | | |_| | |_| | (_) | (__| |__| (_| | |_| | (_) | |_| | |_
/_/   \_\_| |_|\__, |____/ \___/ \___|_____\__,_|\__, |\___/ \__,_|\__|
               |___/                             |___/

```

- Github:[anydoclayout](https://github.com/anyforge/anydoclayout)
- Hugging Face: [anydoclayout](https://huggingface.co/anyforge/anydoclayout)
- ModelScope: [anydoclayout](https://www.modelscope.cn/models/anyforge/anydoclayout)

![](./yolo11s-vis1.jpg)

## train datasets

### 1. labels info
```python
{0: 'header',
 1: 'title',
 2: 'text',
 3: 'table',
 4: 'figure',
 5: 'formula',
 6: 'footer',
 7: 'pagenum'}
```

### 2. datasets info

- train: 841862 (opendata: 667426, selfgen: 174436)
- eval: 5500
- imgsize:1280

### 3. eval results
```python
Class     Images  Instances      Box(P          R)
   all       5500      52274      0.921      0.897
header       1461       2337       0.92      0.878
 title       2308       5473      0.896      0.893 
  text       4149      34156      0.937      0.927
 table       1476       1913      0.946      0.958
figure       1842       3343       0.94       0.94
formula        735       1506      0.881      0.876
footer        745       1157      0.909      0.781
pagenum       2164       2389      0.938      0.919

```

### if you want to get datasets
- email:christnowx@qq.com


### how to use

```python
from pathlib import Path
from ultralytics import YOLO

modelfile = Path(model_dir).joinpath('anydoclayout-yolo11s-imgsz1280.pt')
model = YOLO(modelfile)
res = model.predict('your img file', imgsz = 1280)

```

### Buy me a coffee

- 微信(WeChat)

<div align="left">
    <img src="./zanshan.jpg" width="30%" height="30%">
</div>