AI & ML interests
None defined yet.
selfcorrexp2/llama31_ace_kumar_testtmp07
Viewer
•
Updated
•
15k
•
13
selfcorrexp2/llama31_ace_kumar_testtmp10
Viewer
•
Updated
•
15k
•
7
selfcorrexp2/balanced_model_as_rm_2prompt
Viewer
•
Updated
•
5k
•
12
•
1
selfcorrexp2/balanced_model_as_rm
Viewer
•
Updated
•
5k
•
9
selfcorrexp2/selfcorrexp2_llama3_openmath_1m_ep1_tmp10_goldrm_labeled
Viewer
•
Updated
•
15k
•
8
selfcorrexp2/HanningZhang_Llama3-sft-more-corr-rr60k-3ep_moredatatmp10_vllmexp3
Viewer
•
Updated
•
15k
•
14
selfcorrexp2/HanningZhang_Llama3-sft-more-corr-rr60k-3ep_moredatatmp10
Viewer
•
Updated
•
15k
•
9
selfcorrexp2/HanningZhang_Llama3-sft-more-corr-rr60k-3ep_moredatatmp10_gold_reward
Viewer
•
Updated
•
15k
•
9
selfcorrexp2/balanced_self_rewarding_rm_labeled_llama3_sft_gen_1round_prompt
Viewer
•
Updated
•
15k
•
9
selfcorrexp2/llama3_sft_more_corr_rr0k_3ep_more_datatmp10_vllmexp3
Viewer
•
Updated
•
15k
•
9
selfcorrexp2/llama3_sft_more_corr_rr0k_3ep_more_datatmp10
Viewer
•
Updated
•
15k
•
7
selfcorrexp2/balanced_self_rewarding_rm_labeled_llama3_sft_gen_1round
Viewer
•
Updated
•
15k
•
7
selfcorrexp2/llama3_sft_balanced_corr_rr0k_ep3_train_on_reasoning_more_datatmp10_vllmexp3
Viewer
•
Updated
•
15k
•
9
selfcorrexp2/llama3_sft_less_corr_rr0k_ep3_train_on_reasoning_more_datatmp10_vllmexp3
Viewer
•
Updated
•
15k
•
7
selfcorrexp2/llama3_sft_balanced_corr_rr0k_ep3_train_on_reasoning_more_datatmp10
Viewer
•
Updated
•
15k
•
7
selfcorrexp2/llama3_sft_less_corr_rr0k_ep3_train_on_reasoning_more_datatmp10
Viewer
•
Updated
•
15k
•
7
selfcorrexp2/llama3_sft_balanced_gen2_math_type3
Viewer
•
Updated
•
4.16k
•
4
selfcorrexp2/llama3_sft_balanced_gen2_math_type4
Viewer
•
Updated
•
3.15k
•
8
selfcorrexp2/llama3_sft_balanced_gen2_augmath_type4
Viewer
•
Updated
•
3.8k
•
7
selfcorrexp2/llama3_sft_balanced_gen2_augmath_type3
Viewer
•
Updated
•
4.36k
•
14
selfcorrexp2/llama3_sft_balanced_gen2_mix_type12
Viewer
•
Updated
•
25.2k
•
8
selfcorrexp2/llama3_sft_balanced_gen2_augmath_type12
Viewer
•
Updated
•
12.8k
•
7
selfcorrexp2/llama3_sft_balanced_gen2_math_type12
Viewer
•
Updated
•
12.4k
•
4
selfcorrexp2/llama3_sft_balanced_gen2_math_
Viewer
•
Updated
•
22k
•
5
selfcorrexp2/llama3_sft_balanced_gen2_augmath_
Viewer
•
Updated
•
27.2k
•
8
selfcorrexp2/balancedsft_augmath_sft_gen2_prompt
Viewer
•
Updated
•
27.2k
•
6
selfcorrexp2/balancedsft_math_sft_gen2_prompt
Viewer
•
Updated
•
22k
•
5
selfcorrexp2/llama3_sft_balanced_gen1_math_
Viewer
•
Updated
•
7.5k
•
5
selfcorrexp2/llama3_sft_balanced_gen1_augmath_
Viewer
•
Updated
•
7.57k
•
7
selfcorrexp2/llama3_sft_ift_morecorr_more_datatmp07_vllmexp
Viewer
•
Updated
•
30k
•
5