models 133
Muadil/Llama-3.2-1B-Instruct_sum_DPO_140k_1_20ep_deneme
Text Generation
• 1B • Updated
• 1
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep_4bit
Text Generation
• 1B • Updated
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit
Text Generation
• 1B • Updated
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_2ep_4bit
Text Generation
• 1B • Updated
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep_4bit
Text Generation
• 1B • Updated
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_2ep_4bit
Text Generation
• 1B • Updated
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_4bit
Text Generation
• 1B • Updated
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit
Text Generation
• 1B • Updated
• 1
Muadil/Llama-3.2-1B-Instruct_sum_KTO_10k_1_2ep_4bit
Text Generation
• 1B • Updated
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit
Text Generation
• 1B • Updated
datasets 11
Muadil/dpo_formatted_openai_summary
Viewer
• Updated
• 183k • 6
Muadil/dpo_dataset_train_openai_summary
Viewer
• Updated
• 176k • 3
Muadil/ppo_datasets_summary
Viewer
• Updated
• 176k • 17
Muadil/kto_labeled_openai_summary
Viewer
• Updated
• 365k • 4
• 1
Muadil/cleaned_openai_summary_comparisons
Viewer
• Updated
• 183k • 7
Muadil/all_cleaned_openai_summarize_comparisons_train_val
Viewer
• Updated
• 176k • 3
Muadil/all_unique_cleaned_openai_summarize_comparisons_test
Viewer
• Updated
• 6.24k • 7
Muadil/old_all_cleaned_openai_summarize_comparisons_test
Viewer
• Updated
• 6.24k • 4
Muadil/old_all_cleaned_openai_summarize_comparisons_train_val
Viewer
• Updated
• 176k • 4
Muadil/old_all_unique_cleaned_openai_summarize_comparisons
Viewer
• Updated
• 21k • 3