KORMo-10B models
KORMo
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
KORMo: Korean Open Reasoning Model for Everyone
An open-source hub for Korean language data and model research
🧠 Open Models
- KORMo-Team/KORMo-tokenizer — A tokenizer optimized for bilingual (Korean–English) language representation
- KORMo-Team/KORMo-10B-base — The KORMo-10B pretrained model trained on large-scale Korean and English corpora
- KORMo-Team/KORMo-10B-sft — A fine-tuned model enhanced with long-context reasoning and instruction-following data
- KORMo-Team/KORMo-10B-inst — Final instruction-tuned model with reasoning enhancement and RL (Coming soon; currently awaiting GPU availability)
💡 You can explore the full training history and checkpoints in each model’s
Revisions
tab on Hugging Face.
🌐 Links
- Technical Report — https://arxiv.org/pdf/2510.09426
- Technical Report(Slide-Korean) — https://github.com/MLP-Lab/KORMo-tutorial/blob/main/20251009_MLP_KORMo(Korean).pdf
- Tutorial on Github — https://github.com/MLP-Lab/KORMo-tutorial
- Tutorial on youtube — https://www.youtube.com/@MLPLab
📖 About KORMo
KORMo is an open research initiative dedicated to advancing Korean language understanding and generation through large-scale, fully open-source models and datasets.
We aim to make Korean NLP research transparent, reproducible, and accessible to the global community.
datasets
14
KORMo-Team/UltraFineWeb-ko-synth
Preview
•
Updated
•
170
KORMo-Team/FineWeb2-ko-synth
Preview
•
Updated
•
91
KORMo-Team/Cosmopedia-ko-synth
Preview
•
Updated
•
58
KORMo-Team/NemoPost-ko-synth
Preview
•
Updated
•
47
KORMo-Team/IF-bilingual-sft
Preview
•
Updated
•
274
•
1
KORMo-Team/KORMo-tutorial-datasets
Viewer
•
Updated
•
35k
•
168
•
1
KORMo-Team/KORMo-Self-Introduce
Preview
•
Updated
•
54
•
1
KORMo-Team/NemoPost-ko-translated
Preview
•
Updated
•
46
KORMo-Team/UltraFineWeb-filtered
Preview
•
Updated
•
420
•
1
KORMo-Team/korean-public-corpus
Preview
•
Updated
•
57