quancute/DPOLlama-3.2-1B-Instruct_sum-chosen5_reject_less2-5k_22Mar-2025_A100 1B • Updated Mar 23 • 2
quancute/DPOLlama-3.2-1B-Instruct_sum-chosen5_reject_greater3-20k_22Mar-2025_A100 1B • Updated Mar 23 • 3
quancute/DPOLlama-3.2-1B-Instruct_sum-39k_12Mar-2025_A100_new Text Generation • 1B • Updated Mar 13 • 5