RLHFlow 's Collections

Reinforce-Ada

Training & test sets and finetuned models