AI & ML interests
None defined yet.
selfcorrexp/llama3_it_8b_tmp07_n3
Viewer
•
Updated
•
15k
•
2
selfcorrexp/llama3_it_8b_tmp10_n3
Viewer
•
Updated
•
15k
•
2
selfcorrexp/llama3_regular_balanced_sft_4_ORM_training
Viewer
•
Updated
•
174k
•
2
selfcorrexp/llama3_regular_NON_balanced_sft_4_ORM_training
Viewer
•
Updated
•
327k
•
1
selfcorrexp/llama3_non_delete_4_ORM_training
Viewer
•
Updated
•
191k
•
4
selfcorrexp/llama3_regular_balanced_sft_chat_format
Viewer
•
Updated
•
174k
•
1
selfcorrexp/llama3_additional_rr40k_non_delete_sft_chat_format
Viewer
•
Updated
•
231k
•
2
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_2
Viewer
•
Updated
•
25.5k
•
1
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_1
Viewer
•
Updated
•
25.5k
•
1
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_2
Viewer
•
Updated
•
21.4k
•
1
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_1
Viewer
•
Updated
•
21.4k
•
2
selfcorrexp/llama31_prompt_first_corr_math1
Viewer
•
Updated
•
60k
•
1
selfcorrexp/llama31_prompt_first_wrong_math2
Viewer
•
Updated
•
118k
•
3
selfcorrexp/llama31_prompt_first_wrong_math1
Viewer
•
Updated
•
110k
•
3
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_2nd_round_prompt
Viewer
•
Updated
•
21.4k
•
7
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_2nd_round_prompt
Viewer
•
Updated
•
25.5k
•
2
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_base
Viewer
•
Updated
•
7.01k
•
2
selfcorrexp/baseline_star_rr80k
Viewer
•
Updated
•
257k
•
1
selfcorrexp/baseline_star_rr8ou0k
selfcorrexp/baseline_star
Viewer
•
Updated
•
176k
•
2
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_base
Viewer
•
Updated
•
15.1k
•
2
selfcorrexp/llama3_additional_rr40k_non_delete_sft
Viewer
•
Updated
•
231k
•
2
selfcorrexp/llama3_non_delete_regular_balanced_sft
Viewer
•
Updated
•
191k
•
2
selfcorrexp/llama3_additional_rr80k_NON_balanced_sft
Viewer
•
Updated
•
407k
•
3
selfcorrexp/llama31_prompt_first_wrong_prompt2
Viewer
•
Updated
•
60.4k
•
1
selfcorrexp/llama31_prompt_first_wrong_prompt1
Viewer
•
Updated
•
60k
•
1
selfcorrexp/llama31_prompt_corr_prompt
Viewer
•
Updated
•
60k
•
4
selfcorrexp/llama3_non_balanced_rr10k_2e6_bz32_ep3tmp07
Viewer
•
Updated
•
15k
•
3
selfcorrexp/llama3_non_balanced_rr10k_2e6_bz32_ep3tmp10
Viewer
•
Updated
•
15k
•
3
selfcorrexp/llama3_v2_rlhflow_math2
Viewer
•
Updated
•
7.5k
•
2