tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 9
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 8
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 5
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 4
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 4
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 4
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 4
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp07 Viewer • Updated Jan 22 • 15k • 9
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp10 Viewer • Updated Jan 22 • 15k • 8
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp07 Viewer • Updated Jan 22 • 15k • 8
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp10 Viewer • Updated Jan 22 • 15k • 9
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp07 Viewer • Updated Jan 22 • 15k • 5
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp10 Viewer • Updated Jan 22 • 15k • 4
tmpmodelsave/llamasft_math_ift_balanced_moredata_gold_reward_tmp10_vllmexp Viewer • Updated Jan 22 • 20k • 5
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp07 Viewer • Updated Jan 22 • 15k • 5
tmpmodelsave/llamasft_math_ift_balanced_moredata_gold_reward_tmp07_vllmexp Viewer • Updated Jan 22 • 30k • 3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp10 Viewer • Updated Jan 22 • 15k • 6
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp07 Viewer • Updated Jan 22 • 15k • 6
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp10 Viewer • Updated Jan 22 • 15k • 5
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp07 Viewer • Updated Jan 22 • 15k • 6
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp10 Viewer • Updated Jan 22 • 15k • 6
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp07 Viewer • Updated Jan 22 • 15k • 3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp10 Viewer • Updated Jan 22 • 15k • 3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp07 Viewer • Updated Jan 22 • 15k • 3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp10 Viewer • Updated Jan 22 • 15k • 3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp07 Viewer • Updated Jan 22 • 15k • 4
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp10 Viewer • Updated Jan 22 • 15k • 3
tmpmodelsave/beta05_balanced_type12_sftloss_moredata550tmp07_vllmexp3 Viewer • Updated Jan 22 • 15k • 2