Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
laterabhi
/
grpo-sql-optimizer
like
0
Reinforcement Learning
Safetensors
qwen2
grpo
sql
optimization
License:
mit
Model card
Files
Files and versions
xet
Community
main
grpo-sql-optimizer
Commit History
Upload README.md with huggingface_hub
f4501b6
verified
laterabhi
commited on
9 days ago
Upload grpo_results.png with huggingface_hub
2943355
verified
laterabhi
commited on
9 days ago
Upload grpo_results.png with huggingface_hub
ec97a4d
verified
laterabhi
commited on
9 days ago
Upload training_stats.json with huggingface_hub
6a8fcde
verified
laterabhi
commited on
9 days ago
Upload tokenizer
b6ae4d1
verified
laterabhi
commited on
9 days ago
Upload Qwen2ForCausalLM
0a30d75
verified
laterabhi
commited on
9 days ago
initial commit
ec3abba
verified
laterabhi
commited on
9 days ago