RLinf/RLinf-Gr00t-SFT-Spatial
3B
•
Updated
•
30
•
1
None defined yet.
RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning