RLAIF Experimentation Collection Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance. • 4 items • Updated 5 days ago
RLAIF Experimentation Collection Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance. • 4 items • Updated 5 days ago