Update/Upload model card for LoRA
Browse files
README.md
CHANGED
|
@@ -1,3 +1,41 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
|
| 2 |
|
| 3 |
<div style="display: flex; justify-content: center; align-items: center;">
|
|
@@ -67,12 +105,6 @@ Provides "Normal Mode" (for pre-combined inputs) and "Custom Mode" (upload scene
|
|
| 67 |
python demo.py
|
| 68 |
```
|
| 69 |
|
| 70 |
-
## TODO
|
| 71 |
-
|
| 72 |
-
- [ ] Release the training datasets and testing datasets
|
| 73 |
-
- [ ] Release the training scripts
|
| 74 |
-
- [ ] Release the eval scripts
|
| 75 |
-
- [ ] Support comfyui
|
| 76 |
|
| 77 |
## Acknowledgement
|
| 78 |
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0 # 或者你选择的许可证,例如 mit, cc-by-sa-4.0 等
|
| 3 |
+
tags:
|
| 4 |
+
- text-synthesis
|
| 5 |
+
- scene-text
|
| 6 |
+
- multilingual
|
| 7 |
+
- diffusion
|
| 8 |
+
- dit
|
| 9 |
+
- ocr-free
|
| 10 |
+
- textflux
|
| 11 |
+
- flux # 如果你的模型基于FLUX
|
| 12 |
+
# - text-to-image # 这是一个通用的计算机视觉标签
|
| 13 |
+
# - generated_image_text # 更具体的标签
|
| 14 |
+
library_name: diffusers # 因为你提到了 Diffusers
|
| 15 |
+
pipeline_tag: text-to-image # 或者更具体的任务标签
|
| 16 |
+
# datasets: # 如果你愿意,可以列出主要的训练数据集,即使它们尚未公开发布
|
| 17 |
+
# - your-custom-training-dataset-name
|
| 18 |
+
# metrics: # 如果你有评估指标
|
| 19 |
+
# - fid
|
| 20 |
+
# - ocr_accuracy
|
| 21 |
+
# model-index: # 这部分帮助Hugging Face更好地索引模型和其结果
|
| 22 |
+
# - name: TextFlux # 你的模型名称
|
| 23 |
+
# results:
|
| 24 |
+
# - task:
|
| 25 |
+
# type: text-to-image # 任务类型
|
| 26 |
+
# name: Scene Text Synthesis # 任务的具体名称
|
| 27 |
+
# dataset: # 评估用的数据集
|
| 28 |
+
# name: your-evaluation-dataset
|
| 29 |
+
# type: scene_text_images
|
| 30 |
+
# metrics: # 评估指标
|
| 31 |
+
# - name: OCR Accuracy
|
| 32 |
+
# value: 90.5 # 举例
|
| 33 |
+
# type: ocr_accuracy
|
| 34 |
+
# - name: FID
|
| 35 |
+
# value: 30.2 # 举例
|
| 36 |
+
# type: fid
|
| 37 |
+
---
|
| 38 |
+
|
| 39 |
# TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
|
| 40 |
|
| 41 |
<div style="display: flex; justify-content: center; align-items: center;">
|
|
|
|
| 105 |
python demo.py
|
| 106 |
```
|
| 107 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 108 |
|
| 109 |
## Acknowledgement
|
| 110 |
|