Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

CodCodingCode
/
llama-3.1-8b-GRPO-V2.0

Transformers
TensorBoard
Safetensors
Generated from Trainer
grpo
trl
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-3.1-8b-GRPO-V2.0 / runs /Jul02_20-48-37_192-222-59-149
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
CodCodingCode's picture
CodCodingCode
Upload folder using huggingface_hub
27a72dd verified about 1 month ago
  • events.out.tfevents.1751489318.192-222-59-149.13578.0
    20.5 kB
    xet
    Upload folder using huggingface_hub about 1 month ago