allenai/reward-bench-2
			Viewer
			• 
	
				Updated
					
				• 
			
			1.87k
	
				• 
					
					5.7k
				
				• 
					
					26
				
Datasets, spaces, and models for Reward Bench 2 benchmark and paper!
Display and analyze reward model evaluation results
 
				 
				 
				 
				 
				 
				