Awesome reward models Collection A curated collection of reward models to use with techniques like rejection sampling and RLHF / RLAIF • 4 items • Updated Apr 12, 2024 • 8
Running on T4 Agents 502 RWKV-Gradio-1 💻 502 Generate text responses with a powerful language model