view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas • Dec 9, 2022 • 413
view article Article Introducing ⚔️ AI vs. AI ⚔️ a deep reinforcement learning multi-agents competition system CarlCochet, ThomasSimonini • Feb 7, 2023 • 3