qwen3-8B-sft-mix-v20250921_005 / all_results.json
rulins's picture
Upload folder using huggingface_hub
dc7a9e1 verified
raw
history blame contribute delete
202 Bytes
{
"epoch": 5.0,
"total_flos": 60301855162368.0,
"train_loss": 0.9732139451163155,
"train_runtime": 2951.2184,
"train_samples_per_second": 1.355,
"train_steps_per_second": 0.012
}