File size: 5,033 Bytes

---
license: apache-2.0
tags:
- onnx
- paddleocr
- ocr
- computer-vision
- text-recognition
- text-detection
library_name: onnxruntime
pipeline_tag: image-to-text
base_model: 
- PaddlePaddle/PP-OCRv5_server_det
- PaddlePaddle/PP-OCRv5_server_rec
---

# PaddleOCR ONNX Models

PaddleOCR PP-OCRv5 models converted to ONNX format for efficient OCR inference.

[English](#english) | [中文說明](#中文說明)

## 中文說明

這是 PaddleOCR PP-OCRv5 模型的 ONNX 格式版本，用於高效率的文字辨識推理。

### 模型檔案說明

| 檔案名稱 | 說明 |
|---------|------|
| `PP-OCRv5_server_det_infer.onnx` | 文字檢測模型 - 偵測圖片中文字的位置 |
| `PP-OCRv5_server_rec_infer.onnx` | 文字辨識模型 - 辨識文字內容 |
| `PP-LCNet_x1_0_textline_ori_infer.onnx` | 文字方向分類模型 |
| `PP-LCNet_x1_0_doc_ori_infer.onnx` | 文件方向修正模型 |
| `UVDoc_infer.onnx` | 文件展平模型 |
| `PP-OCRv5_server_rec_infer.yml` | 字符字典配置檔 |

### 快速使用

```python
from huggingface_hub import hf_hub_download

# 下載文字檢測模型
det_model_path = hf_hub_download(
    repo_id="marsena/paddleocr-onnx-models",
    filename="PP-OCRv5_server_det_infer.onnx"
)

# 下載文字辨識模型  
rec_model_path = hf_hub_download(
    repo_id="marsena/paddleocr-onnx-models",
    filename="PP-OCRv5_server_rec_infer.onnx"
)
```

### 一次下載所有模型

```python
from huggingface_hub import snapshot_download

# 下載所有模型檔案到本地資料夾
snapshot_download(
    repo_id="marsena/paddleocr-onnx-models",
    local_dir="./paddleocr_onnx"
)
```

### 支援的語言和文字類型

- **語言支援**: 簡體中文、繁體中文、英文、日文
- **文字類型**: 印刷體、手寫體、直書文字、旋轉文字、彎曲文字
- **輸入格式**: 圖片檔案 (JPEG, PNG)
- **運行環境**: 支援 CPU 和 GPU 推理

---

## English

## Model Files

| File | Description |
|------|-------------|
| `PP-OCRv5_server_det_infer.onnx` | Text detection model |
| `PP-OCRv5_server_rec_infer.onnx` | Text recognition model |
| `PP-LCNet_x1_0_textline_ori_infer.onnx` | Text orientation classification |
| `PP-LCNet_x1_0_doc_ori_infer.onnx` | Document orientation correction |
| `UVDoc_infer.onnx` | Document unwarping |
| `PP-OCRv5_server_rec_infer.yml` | Character dictionary config |

## Source Models

These ONNX models are converted from official PaddlePaddle PP-OCRv5 models:

- **Detection Model**: [PaddlePaddle/PP-OCRv5_server_det](https://huggingface.co/PaddlePaddle/PP-OCRv5_server_det)
- **Recognition Model**: [PaddlePaddle/PP-OCRv5_server_rec](https://huggingface.co/PaddlePaddle/PP-OCRv5_server_rec)
- **Official Documentation**: [PP-OCRv5 Introduction](https://paddlepaddle.github.io/PaddleOCR/main/en/version3.x/algorithm/PP-OCRv5/PP-OCRv5.html)

## Usage

### Download Specific Model

```python
from huggingface_hub import hf_hub_download

# Download detection model
det_model_path = hf_hub_download(
    repo_id="marsena/paddleocr-onnx-models",
    filename="PP-OCRv5_server_det_infer.onnx"
)

# Download recognition model  
rec_model_path = hf_hub_download(
    repo_id="marsena/paddleocr-onnx-models",
    filename="PP-OCRv5_server_rec_infer.onnx"
)
```

### Download All Models

```python
from huggingface_hub import snapshot_download

# Download all model files to local directory
snapshot_download(
    repo_id="marsena/paddleocr-onnx-models",
    local_dir="./paddleocr_onnx"
)
```

### ONNX Runtime Inference

```python
import onnxruntime as ort
import numpy as np

# Load model
session = ort.InferenceSession("PP-OCRv5_server_det_infer.onnx")

# Run inference
input_name = session.get_inputs()[0].name
output = session.run(None, {input_name: input_data})
```

## Model Specifications

- **Languages**: Simplified Chinese, Traditional Chinese, English, Japanese
- **Text Types**: Printed text, handwriting, vertical text, rotated text
- **Input Format**: Images (JPEG, PNG)
- **Output Format**: Bounding boxes + recognized text
- **Runtime**: ONNX Runtime 1.16+
- **Hardware**: CPU and GPU inference supported

## License

These models follow the **Apache License 2.0**, consistent with the original PaddleOCR project.

- **PaddleOCR Repository**: https://github.com/PaddlePaddle/PaddleOCR
- **License Details**: [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0)

## Conversion Information

- **Conversion Tool**: Paddle2ONNX
- **ONNX Version**: 1.12+
- **Source Framework**: PaddlePaddle 2.5+
- **Conversion Date**: January 2025

## Citation

If you use these models in your research, please cite the original PaddleOCR paper:

```bibtex
@misc{paddleocr2020,
    title={PaddleOCR: Awesome multilingual OCR toolkits},
    author={PaddlePaddle Authors},
    howpublished = {\url{https://github.com/PaddlePaddle/PaddleOCR}},
    year={2020}
}
```

## Issues

For model usage issues, please report to the original PaddleOCR repository:
- **PaddleOCR Issues**: https://github.com/PaddlePaddle/PaddleOCR/issues