Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
22
33
32
Rui Yang
PRO
Ray2333
Follow
Leeeth's profile picture
gentlebowl's profile picture
dark-pen's profile picture
15 followers
·
9 following
https://yangrui2015.github.io
YangRui2015
AI & ML interests
Deep Reinforcement Learning
Recent Activity
upvoted
a
paper
3 days ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
View all activity
Organizations
Ray2333
's datasets
1
Sort: Recently updated
Ray2333/RiC_harmless_helpful
Viewer
•
Updated
Jul 12, 2024
•
291k
•
14