anyforge
/

anydoclayout

Model card Files Files and versions

anydoclayout / README.md

qianliyx's picture

Update README.md

660f180 verified 1 day ago

|

history blame contribute delete

2.52 kB

	---
	license: apache-2.0
	pipeline_tag: image-to-text
	tags:
	- layout
	---

	## anydoclayout
	> docs layout detection

	<a href="https://huggingface.co/anyforge/anydoclayout" target="_blank"><img src="https://img.shields.io/badge/%F0%9F%A4%97-HuggingFace-blue"></a>
	<a href="https://www.modelscope.cn/models/anyforge/anydoclayout" target="_blank"><img alt="Static Badge" src="https://img.shields.io/badge/%E9%AD%94%E6%90%AD-ModelScope-blue"></a>
	<a href=""><img src="https://img.shields.io/badge/Python->=3.6-aff.svg"></a>
	<a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a>

	```
	_ ____ _ _
	/ \ _ __ _ _\| _ \ ___ ___\| \| __ _ _ _ ___ _ _\| \|_
	/ _ \ \| '_ \\| \| \| \| \| \| \|/ _ \ / __\| \| / _` \| \| \| \|/ _ \\| \| \| \| __\|
	/ ___ \\| \| \| \| \|_\| \| \|_\| \| (_) \| (__\| \|__\| (_\| \| \|_\| \| (_) \| \|_\| \| \|_
	/_/ \_\_\| \|_\|\__, \|____/ \___/ \___\|_____\__,_\|\__, \|\___/ \__,_\|\__\|
	\|___/ \|___/

	```

	- Github：[anydoclayout](https://github.com/anyforge/anydoclayout)
	- Hugging Face: [anydoclayout](https://huggingface.co/anyforge/anydoclayout)
	- ModelScope: [anydoclayout](https://www.modelscope.cn/models/anyforge/anydoclayout)

	![](./yolo11s-vis1.jpg)

	## train datasets

	### 1. labels info
	```python
	{0: 'header',
	1: 'title',
	2: 'text',
	3: 'table',
	4: 'figure',
	5: 'formula',
	6: 'footer',
	7: 'pagenum'}
	```

	### 2. datasets info

	- train: 841862 (opendata: 667426, selfgen: 174436)
	- eval: 5500
	- imgsize：1280

	### 3. eval results
	```python
	Class Images Instances Box(P R)
	all 5500 52274 0.921 0.897
	header 1461 2337 0.92 0.878
	title 2308 5473 0.896 0.893
	text 4149 34156 0.937 0.927
	table 1476 1913 0.946 0.958
	figure 1842 3343 0.94 0.94
	formula 735 1506 0.881 0.876
	footer 745 1157 0.909 0.781
	pagenum 2164 2389 0.938 0.919

	```

	### if you want to get datasets
	- email：christnowx@qq.com


	### how to use

	```python
	from pathlib import Path
	from ultralytics import YOLO

	modelfile = Path(model_dir).joinpath('anydoclayout-yolo11s-imgsz1280.pt')
	model = YOLO(modelfile)
	res = model.predict('your img file', imgsz = 1280)

	```

	### Buy me a coffee

	- 微信(WeChat)

	<div align="left">
	<img src="./zanshan.jpg" width="30%" height="30%">
	</div>