zhang xinming's picture

2 5

zhang xinming

zhang0212

·

AI & ML interests

None yet

Organizations

upvoted 2 articles 9 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 413

Article

Introducing ⚔️ AI vs. AI ⚔️ a deep reinforcement learning multi-agents competition system

CarlCochet, ThomasSimonini

•

Feb 7, 2023

• 3