Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
laterabhi
/
grpo-sql-optimizer
like
0
Reinforcement Learning
Safetensors
qwen2
grpo
sql
optimization
License:
mit
Model card
Files
Files and versions
xet
Community
main
grpo-sql-optimizer
1,000 MB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
laterabhi
Upload README.md with huggingface_hub
f4501b6
verified
9 days ago
.gitattributes
Safe
1.62 kB
Upload grpo_results.png with huggingface_hub
9 days ago
README.md
565 Bytes
Upload README.md with huggingface_hub
9 days ago
chat_template.jinja
Safe
2.51 kB
Upload tokenizer
9 days ago
config.json
Safe
1.28 kB
Upload Qwen2ForCausalLM
9 days ago
generation_config.json
Safe
241 Bytes
Upload Qwen2ForCausalLM
9 days ago
grpo_results.png
118 kB
xet
Upload grpo_results.png with huggingface_hub
9 days ago
model.safetensors
988 MB
xet
Upload Qwen2ForCausalLM
9 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload tokenizer
9 days ago
tokenizer_config.json
Safe
664 Bytes
Upload tokenizer
9 days ago
training_stats.json
420 Bytes
Upload training_stats.json with huggingface_hub
9 days ago