Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rzzhan
/
ExGRPO-Llama3.1-8B-Zero
like
0
Safetensors
llama
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
ExGRPO-Llama3.1-8B-Zero
/
model-00001-of-00007.safetensors
Commit History
Upload folder using huggingface_hub
c690656
verified
rzzhan
commited on
22 days ago