sravanthib/custom_accelerate-Qwen1.5b-20k-wd-warmup-same-as-nemo Text Generation • Updated Jul 28 • 4
sravanthib/stage-2-customauto-config-llama-3-2-custom-1000-steps-logging-old-deepspeed Text Generation • Updated Jul 26 • 4
sravanthib/stage-2-custom-llama-3-2-custom-1000-steps-logging-old-deepspeed Text Generation • Updated Jul 26 • 4
sravanthib/lr-20k-stage-0-1e-4-Qwen2-5-1-5-B-Instruct-custom-1000-steps-logging-old-deepspeed Text Generation • Updated Jul 25 • 4
sravanthib/lr-20k-stage-0-1e-4-llama-3-2-1b-custom-1000-steps-logging-old-deepspeed Text Generation • Updated Jul 25 • 4
sravanthib/lr-1e-4-llama-3-2-1b-custom-1000-steps-logging-old-deepspeed Text Generation • Updated Jul 25 • 4
sravanthib/lr-3e-6-llama-3-2-1b-custom-1000-steps-logging-old-deepspeed Text Generation • Updated Jul 24 • 3