Xiangmin Yi
lazyyxm
ยท
AI & ML interests
RL
LLM
Recent Activity
upvoted a paper 27 days ago
RLinf-Co: Reinforcement Learning-Based Sim-Real Co-Training for VLA Models upvoted a paper about 1 month ago
RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI upvoted a paper about 1 month ago
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Organizations
None yet