Hyeongwon/P9-split1_3times_prob_Qwen3-4B-Base_0319-02 Text Generation • 196k • Updated about 16 hours ago • 52
Hyeongwon/P2-split2_bs256_prob_Qwen3-4B-Base_0317-01 Text Generation • 196k • Updated 1 day ago • 113
Hyeongwon/PH_det_sft_FC_swap_labewise_data_oversampling_bf16_lr0.00002_context_12k-Qwen3-8B-Base Text Generation • 308k • Updated 21 days ago • 85
Hyeongwon/PH_prob_sft_FC_swap_labewise_data_oversampling_bf16_lr0.00002_context_12k-Qwen3-8B-Base Text Generation • 308k • Updated 22 days ago • 97