VGraf/self_compare_alpacaeval_related_8maxturns_complete_805 Viewer • Updated about 2 hours ago • 3.22k
VGraf/repeat_response_flip_tulu_5maxturns_big_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated Oct 28, 2025 • 21.5k • 4
VGraf/paraphrase_train_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated Oct 27, 2025 • 5.28k • 5
VGraf/general_responses_dev_8maxturns_truncated2048_gpt-4ochosen_gpt-3.5-turbo-0125rejected Viewer • Updated Oct 27, 2025 • 5.19k • 4
VGraf/self-talk_gpt3.5_gpt4o_prefpairs_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27, 2025 • 10k • 5
VGraf/repeat_tulu_5maxturns_big_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27, 2025 • 4k • 9
VGraf/paraphrase_train_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27, 2025 • 4k • 6
VGraf/general_responses_dev_8maxturns_truncated2048_Qwen__Qwen3-32Bchosen_Qwen__Qwen3-0.6Brejected Viewer • Updated Oct 27, 2025 • 4k • 7