AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
$π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
-
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning • 8B • Updated • 281 -
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Main
8B • Updated • 279 -
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning • 8B • Updated • 65 -
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning • 8B • Updated • 59 • 2
-
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning • 8B • Updated • 281 -
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Main
8B • Updated • 279 -
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning • 8B • Updated • 65 -
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning • 8B • Updated • 59 • 2
models
70
RLinf/RLinf-Gr00t-RL-Stack-cube
Updated
•
8
RLinf/RLinf-Gr00t-SFT-Stack-cube
3B
•
Updated
•
8
RLinf/WideSeek-R1-4b
Text Generation
•
4B
•
Updated
•
49
•
1
RLinf/RLinf-Pi05-GSEnv-PutCubeOnPlate-V0-SFT
4B
•
Updated
RLinf/RLinf-OpenVLAOFT-RoboTwin-RL-move_can_pot
8B
•
Updated
•
8
RLinf/RLinf-OpenVLAOFT-RoboTwin-RL-lift_pot
8B
•
Updated
•
7
RLinf/RLinf-OpenVLAOFT-RoboTwin-SFT-move_can_pot
8B
•
Updated
•
12
RLinf/RLinf-OpenVLAOFT-RoboTwin-SFT-handover_block
8B
•
Updated
•
15
RLinf/RLinf-OpenVLAOFT-RoboTwin-SFT-lift_pot
8B
•
Updated
•
15
RLinf/RLinf-OpenVLAOFT-RoboTwin-RL-handover_block
Updated