Single Shuffled Data Data shuffled only at the document-level babylm-seqlen/train_100M_128_single_shuffle Viewer • Updated Apr 8 • 1.28M • 4 babylm-seqlen/train_100M_1024_single_shuffle Viewer • Updated Apr 8 • 160k • 4 babylm-seqlen/train_100M_64_single_shuffle Viewer • Updated Apr 8 • 2.56M • 9 babylm-seqlen/train_100M_256_single_shuffle Viewer • Updated Apr 8 • 639k • 4
Double Shuffled Data Data shuffled at both the document-level, and again at the tokenized level babylm-seqlen/train_100M_256 Viewer • Updated Apr 7 • 639k • 12 babylm-seqlen/train_100M_1024 Viewer • Updated Apr 7 • 160k • 14 babylm-seqlen/train_100M_16384 Viewer • Updated Apr 7 • 9.86k • 16 babylm-seqlen/train_100M_4096 Viewer • Updated Apr 7 • 39.8k • 11
Single Shuffled Data Data shuffled only at the document-level babylm-seqlen/train_100M_128_single_shuffle Viewer • Updated Apr 8 • 1.28M • 4 babylm-seqlen/train_100M_1024_single_shuffle Viewer • Updated Apr 8 • 160k • 4 babylm-seqlen/train_100M_64_single_shuffle Viewer • Updated Apr 8 • 2.56M • 9 babylm-seqlen/train_100M_256_single_shuffle Viewer • Updated Apr 8 • 639k • 4
Double Shuffled Data Data shuffled at both the document-level, and again at the tokenized level babylm-seqlen/train_100M_256 Viewer • Updated Apr 7 • 639k • 12 babylm-seqlen/train_100M_1024 Viewer • Updated Apr 7 • 160k • 14 babylm-seqlen/train_100M_16384 Viewer • Updated Apr 7 • 9.86k • 16 babylm-seqlen/train_100M_4096 Viewer • Updated Apr 7 • 39.8k • 11