V-Reflection: Transforming MLLMs from Passive Observers to Active Interrogators Paper • 2604.03307 • Published 11 days ago • 14
PRIMO R1 Collection Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 6 items • Updated 3 days ago • 4
From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation Paper • 2603.15600 • Published 25 days ago • 7 • 3
PRIMO R1 Collection Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 6 items • Updated 3 days ago • 4
From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation Paper • 2603.15600 • Published 25 days ago • 7
PRIMO R1 Collection Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 6 items • Updated 3 days ago • 4
PRIMO R1 Collection Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 6 items • Updated 3 days ago • 4