RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

baohao  updated a collection 3 days ago
Reinforce-Ada
baohao  updated a collection 3 days ago
Reinforce-Ada
baohao  updated a model 3 days ago
RLHFlow/Qwen2.5-Math-1.5B-DAPO-easy
View all activity