e1_code_ms_qwq / all_results.json
neginr's picture
End of training
c856acc verified
raw
history blame contribute delete
222 Bytes
{
"epoch": 4.982278481012658,
"total_flos": 2.1872555195142144e+18,
"train_loss": 1.0362177829916883,
"train_runtime": 17180.708,
"train_samples_per_second": 9.196,
"train_steps_per_second": 0.072
}