RLHFlow

university

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

Chenlu123 submitted a paper 27 days ago

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

baohao submitted a paper 2 months ago

Self-Hinting Language Models Enhance Reinforcement Learning

baohao updated a collection 6 months ago

View all activity

Papers

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

View all Papers

RLHFlow 's Papers 1

Submitted by

Wei Xiong

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

RLHFlow