jkrs
jkrs
ยท
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted a paper 6 months ago
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect
Verifiers liked a dataset over 1 year ago
Anthropic/hh-rlhfOrganizations
None yet