Pre-training datasets 0xDing/wikipedia-cn-20230720-filtered Viewer • Updated Jul 23, 2023 • 255k • 2.03k • 170 Skywork/SkyPile-150B Viewer • Updated Dec 7, 2023 • 1.76M • 21.9k • 404
Supervised fine-tuning datasets BelleGroup/train_2M_CN Viewer • Updated Apr 8, 2023 • 2M • 1.36k • 110 BelleGroup/train_0.5M_CN Viewer • Updated Apr 3, 2023 • 519k • 2.03k • 120 BelleGroup/generated_chat_0.4M Viewer • Updated Apr 8, 2023 • 396k • 490 • 68 BelleGroup/school_math_0.25M Viewer • Updated Apr 8, 2023 • 248k • 456 • 105
Pre-training datasets 0xDing/wikipedia-cn-20230720-filtered Viewer • Updated Jul 23, 2023 • 255k • 2.03k • 170 Skywork/SkyPile-150B Viewer • Updated Dec 7, 2023 • 1.76M • 21.9k • 404
Supervised fine-tuning datasets BelleGroup/train_2M_CN Viewer • Updated Apr 8, 2023 • 2M • 1.36k • 110 BelleGroup/train_0.5M_CN Viewer • Updated Apr 3, 2023 • 519k • 2.03k • 120 BelleGroup/generated_chat_0.4M Viewer • Updated Apr 8, 2023 • 396k • 490 • 68 BelleGroup/school_math_0.25M Viewer • Updated Apr 8, 2023 • 248k • 456 • 105