TingchenFu/reason_general_3k_qwen-2.5-math-1.5b_05311421 Text Generation • 2B • Updated 20 days ago • 6
TingchenFu/general_reason_3k_qwen-2.5-math-7b_06091810 Text Generation • 8B • Updated 21 days ago • 5
TingchenFu/general_reason_3k_qwen-2.5-math-1.5b_06021434 Text Generation • 2B • Updated 21 days ago • 6
TingchenFu/reason_general_3k_qwen-2.5-math-7b_06021436 Text Generation • 8B • Updated 21 days ago • 7
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.1_bs32lr1.41e-5decay0.0cosine_07070300 Text Classification • Updated Jul 8, 2024 • 6
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.05_bs32lr1.41e-5decay0.0cosine_07070300 Text Classification • Updated Jul 8, 2024 • 6
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.02_bs32lr1.41e-5decay0.0cosine_07070257 Text Classification • Updated Jul 8, 2024 • 6 • 1
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.01_bs32lr1.41e-5decay0.0cosine_07070257 Text Classification • Updated Jul 8, 2024 • 6