AdGPT / lauguage_model_fine_tuning /ppo_fine_tune_teacher.py

Commit History

ADD: LLM SFT, RLHF and Distillation
c1c9e88

goodmodeler commited on