YI REN
Joshua-Ren
AI & ML interests
LLM, Cognitive science
Recent Activity
upvoted
a
paper
about 19 hours ago
Token Hidden Reward: Steering Exploration-Exploitation in Group Relative
Deep Reinforcement Learning
upvoted
a
paper
1 day ago
SimKO: Simple Pass@K Policy Optimization
upvoted
a
paper
5 months ago
Learning Dynamics of LLM Finetuning
Organizations
None yet