RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

baohao  updated a collection 10 days ago
Reinforce-Ada
baohao  updated a collection 10 days ago
Reinforce-Ada
baohao  updated a model 10 days ago
RLHFlow/Qwen2.5-Math-1.5B-DAPO-easy
View all activity

RLHFlow 's collections 12

RLHFLow Reward Models
Reward models trained by RLHFlow codebase (https://github.com/RLHFlow/RLHF-Reward-Modeling/)
RLHFLow Reward Models
Reward models trained by RLHFlow codebase (https://github.com/RLHFlow/RLHF-Reward-Modeling/)