shipeng luo's picture

shipeng luo

luoagent

·

AI & ML interests

ML AI

Recent Activity

upvoted a paper about 21 hours ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

upvoted a paper about 21 hours ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

upvoted a paper about 22 hours ago

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet