shipeng luo
luoagent
ยท
AI & ML interests
ML AI
Recent Activity
upvoted a paper about 21 hours ago
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models upvoted a paper about 21 hours ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation upvoted a paper about 22 hours ago
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMsOrganizations
None yet