KoBEiT3 / README.md
Sean Lee
readme updated
3e383ca
---
license: mit
language:
- ko
pipeline_tag: visual-question-answering
tags:
- beit3
---
# BEiT3 based Korean VQA Model # (https://github.com/SeanJeonghwanLee/KoBEiT3)
## Basic information ##
### Model ###
- Base Model : beit3_large_indomain_patch16_224 (https://github.com/microsoft/unilm/tree/master/beit3)
- best epoch : 8
- learning rate : 2e-5
- fixed seed : 42
- Tokenizer
- korean sentencepiece tokenizer trained on korean wikipedia
### Dataset ###
- KoBEiT3
- aihub ์‹œ๊ฐ์ •๋ณด ๊ธฐ๋ฐ˜ ์งˆ์˜์‘๋‹ต (https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=104)
* Only Korean can access to the dataset
- Tokenizer
- kowiki-latest-pages-articles.xml.bz2 (https://dumps.wikimedia.org/kowiki/latest/)