AdversarialRLHF
/

ppo_pythia410m_tldr6.9b_rm410mdata_mergedsft_propprefix

Model card Files Files and versions

ppo_pythia410m_tldr6.9b_rm410mdata_mergedsft_propprefix / tokenizer.json

Muqeeth's picture

Training in progress, step 52

65a6a6f verified 5 months ago

history contribute delete

3.56 MB

File too large to display, you can check the raw version instead.