Artifacts released with Safety Pretraining
AI & ML interests
None defined yet.
Recent Activity
models 179
locuslab/safelm-1.7b
Updated • 1.06k • 1
locuslab/safelm-1.7b-instruct
2B • Updated • 231 • 1
locuslab/ift_gsm-smollm2-1.7b-all_raw_folders_meta-600B-metamix3p-1k-0
2B • Updated • 2
locuslab/ift-smollm2-1.7b-all_raw_folders_meta-600B-metamix3p-1k-0
2B • Updated • 9
locuslab/base-smollm2-1.7b-all_raw_folders_meta-600B-mbs8-gbs1024-17feb
Updated • 1
locuslab/mix_ift_v9-smollm2-1.7b-score0_rephrase123_mild_ref45_metadata45_10p-600B-metamix3p-1k-0
2B • Updated • 6
locuslab/mix_ift_v9-smollm2-1.7b-score0_rephrase123_mild_ref45_metadata_5p-600B-metamix3p-1k-0
2B • Updated • 7
locuslab/mix_ift_v9-smollm2-1.7b-score0_rephrased_from_beginning_meta-600B-metamix3p-1k-0
2B • Updated • 9
locuslab/mix_ift_v9-smollm2-1.7b-score0_60p_rephrase_ref_and_metadata_5p-600B-metamix3p-1k-0
2B • Updated • 5
locuslab/mix_ift_v9-smollm2-1.7b-score0_60p_rephrase123_mild_ref45_metadata_5p-600B-metamix3p-1k-0
2B • Updated • 2
datasets 12
locuslab/fineweb_annotated
Viewer • Updated • 176M • 1.3k • 2
locuslab/refuseweb
Viewer • Updated • 1.65M • 258 • 1
locuslab/safeweb
Viewer • Updated • 14.8M • 25.2k • 3
locuslab/moral_education
Viewer • Updated • 2.81M • 2.26k • 2
locuslab/jb-completions
Viewer • Updated • 990 • 72 • 1
locuslab/multi_password_eval
Viewer • Updated • 900 • 18
locuslab/password_eval
Viewer • Updated • 500 • 25
locuslab/context_parametric_conflict
Preview • Updated • 13
locuslab/TOFU
Viewer • Updated • 18.1k • 122k • 51
locuslab/safety_data_annotated
Viewer • Updated • 39k • 24