Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dnotitia
's Collections
DNA 2.1
Qwen3 Modified (chat_template)
DNA 2.0
DNA 2.0 (RC2)
DNA 2.0 (RC1)
DNA-R1
DNA 1.0
HMC
Smoothie Qwen3
Smoothie Qwen2.5
Private Models
Private Datasets (DNA 2.0)
Private Datasets (DNA 2.0 Evaluation)
Private Datasets (Qwen3 Korean)
Private Datasets (SFT)
Private Datasets (CoT)
Private Datasets (Only Answer)
Private Datasets (MATH)
Private Datasets (Reasoning, ko)
Private Datasets (Reasoning, en)
Private Datasets (CPT)
Private Datasets (DPO)
Private Datasets (Coding)
Private Datasets (RL, GRPO)
Private Datasets (Smoothie Qwen)
DNA 2.1
updated
10 days ago
Making Qwen3 Think in Korean with Reinforcement Learning https://arxiv.org/abs/2508.10355
Upvote
-
Making Qwen3 Think in Korean with Reinforcement Learning
Paper
•
2508.10355
•
Published
Aug 14
Upvote
-
Share collection
View history
Collection guide
Browse collections