llama-3.1-8b-grpo-v1.2 / tokenizer.json

Commit History

GRPO-trained model from checkpoint-670
f23aa8f
verified

CodCodingCode commited on