Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
YI REN
Joshua-Ren
Follow
https://joshua-ren.github.io/
Joshua-Ren
AI & ML interests
LLM, Cognitive science
Recent Activity
upvoted
a
paper
about 7 hours ago
Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning
upvoted
a
paper
about 21 hours ago
SimKO: Simple Pass@K Policy Optimization
upvoted
a
paper
5 months ago
Learning Dynamics of LLM Finetuning
View all activity
Organizations
None yet
Papers
1
arxiv:
2407.10490
models
0
None public yet
datasets
0
None public yet