LitBench SAA-Lab/LitBench-Train Viewer • Updated 27 days ago • 43.8k • 362 • 1 SAA-Lab/LitBench-Rationales Viewer • Updated May 16 • 43.7k • 80 SAA-Lab/LitBench-Test Viewer • Updated 27 days ago • 2.38k • 89 LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1 • 4
LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1 • 4
UltraSuite SAA-Lab/Qwen2.5-Omni-7B-UltraSuite 11B • Updated May 9 SAA-Lab/Qwen2.5-Omni-3B-UltraSuite 6B • Updated May 10 • 3 SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite 8B • Updated May 10 • 5 • 2 SAA-Lab/Qwen2.5-Omni-7B-UltraSuite-woA 11B • Updated May 14 • 3
Creative-Writing-Verifier Data and model for SAA-Lab/writingprompts-pairwise-test Viewer • Updated Mar 9 • 1k • 4 SAA-Lab/writingprompts-pairwise-train Viewer • Updated Mar 9 • 19.7k • 4 SAA-Lab/wp_test_0421 Viewer • Updated Apr 21 • 6.22k • 3 • 1 SAA-Lab/wp_train_0421 Viewer • Updated Apr 21 • 48.8k • 4
SLPHelmDatasets SAA-Lab/SLPHelmDataset Viewer • Updated May 15 • 19.4k • 1.1k SAA-Lab/SLPHelmUltraSuite Viewer • Updated May 16 • 7.54k • 140 SAA-Lab/SLPHelmManualLabels Viewer • Updated May 14 • 926 • 182
WP Test SAA-Lab/test_jan25 Viewer • Updated May 9 • 155 • 3 SAA-Lab/test-jan24 Viewer • Updated May 9 • 796 • 3 SAA-Lab/test_march23 Viewer • Updated May 9 • 1.9k • 3 SAA-Lab/test_oct23 Viewer • Updated May 9 • 1k • 3
LitBench SAA-Lab/LitBench-Train Viewer • Updated 27 days ago • 43.8k • 362 • 1 SAA-Lab/LitBench-Rationales Viewer • Updated May 16 • 43.7k • 80 SAA-Lab/LitBench-Test Viewer • Updated 27 days ago • 2.38k • 89 LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1 • 4
LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1 • 4
SLPHelmDatasets SAA-Lab/SLPHelmDataset Viewer • Updated May 15 • 19.4k • 1.1k SAA-Lab/SLPHelmUltraSuite Viewer • Updated May 16 • 7.54k • 140 SAA-Lab/SLPHelmManualLabels Viewer • Updated May 14 • 926 • 182
UltraSuite SAA-Lab/Qwen2.5-Omni-7B-UltraSuite 11B • Updated May 9 SAA-Lab/Qwen2.5-Omni-3B-UltraSuite 6B • Updated May 10 • 3 SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite 8B • Updated May 10 • 5 • 2 SAA-Lab/Qwen2.5-Omni-7B-UltraSuite-woA 11B • Updated May 14 • 3
WP Test SAA-Lab/test_jan25 Viewer • Updated May 9 • 155 • 3 SAA-Lab/test-jan24 Viewer • Updated May 9 • 796 • 3 SAA-Lab/test_march23 Viewer • Updated May 9 • 1.9k • 3 SAA-Lab/test_oct23 Viewer • Updated May 9 • 1k • 3
Creative-Writing-Verifier Data and model for SAA-Lab/writingprompts-pairwise-test Viewer • Updated Mar 9 • 1k • 4 SAA-Lab/writingprompts-pairwise-train Viewer • Updated Mar 9 • 19.7k • 4 SAA-Lab/wp_test_0421 Viewer • Updated Apr 21 • 6.22k • 3 • 1 SAA-Lab/wp_train_0421 Viewer • Updated Apr 21 • 48.8k • 4