Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dongboklee
/
gPRM-14B-merged
like
2
Text Generation
Transformers
Safetensors
English
qwen2
lora
reward-model
conversational
text-generation-inference
arxiv:
2510.00492
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
main
gPRM-14B-merged
Commit History
Update README.md
b0588ae
verified
dongboklee
commited on
15 days ago
Update README.md
e9d822e
verified
dongboklee
commited on
15 days ago
Create README.md
4588abf
verified
dongboklee
commited on
18 days ago
Upload folder using huggingface_hub
20224fb
verified
dongboklee
commited on
22 days ago
initial commit
a73a522
verified
dongboklee
commited on
22 days ago