Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Ach0
/
GCPO-R1-1.5B
like
0
Text Generation
Safetensors
English
qwen2
GRPO
DAPO
GCPO
RL
RLVR
conversational
arxiv:
2510.07790
License:
mit
Model card
Files
Files and versions
xet
Community
main
GCPO-R1-1.5B
/
tokenizer.json
Commit History
Upload folder using huggingface_hub
2d79974
verified
Ach0
commited on
11 days ago