Papers
		
		
	
	arxiv:2510.17950
		RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies
Published on Oct 20
		Authors:
		
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
			
		
Abstract
Testing on real machines is indispensable for robotic control algorithms. In the context of learning-based algorithms, especially VLA models, demand for large-scale evaluation, i.e. testing a large number of models on a large number of tasks, is becoming increasingly urgent. However, doing this right is highly non-trivial, especially when scalability and reproducibility is taken into account. In this report, we describe our methodology for constructing RoboChallenge, an online evaluation system to test robotic control algorithms, and our survey of recent state-of-the-art VLA models using our initial benchmark Table30.
Models citing this paper 0
No model linking this paper
Cite arxiv.org/abs/2510.17950 in a model README.md to link it from this page.
				
			Datasets citing this paper 0
No dataset linking this paper
Cite arxiv.org/abs/2510.17950 in a dataset README.md to link it from this page.
				
			Spaces citing this paper 0
No Space linking this paper
Cite arxiv.org/abs/2510.17950 in a Space README.md to link it from this page.
				
			Collections including this paper 0
No Collection including this paper
Add this paper to a
					collection
					to link it from this page.