Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tobil
/
grpo_output
like
0
Transformers
Safetensors
Generated from Trainer
trl
hf_jobs
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
grpo_output
/
ref
21.8 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
tobil
tobil/qmd-query-expansion-qwen3.5-2B-grpo
55859dc
verified
17 days ago
adapter_config.json
1.06 kB
tobil/qmd-query-expansion-qwen3.5-2B-grpo
17 days ago
adapter_model.safetensors
21.8 MB
xet
tobil/qmd-query-expansion-qwen3.5-2B-grpo
17 days ago