ToolRL: Reward is All Tool Learning Needs
emre can PRO
emrecanacikgoz
AI & ML interests
None yet
Organizations
models
18

emrecanacikgoz/Qwen2.5-7B-Instruct-ToolRL-grpo-cold
8B
•
Updated
•
16
•
2

emrecanacikgoz/lorem-sft400-only
8B
•
Updated
•
3

emrecanacikgoz/lorem-base
8B
•
Updated
•
3

emrecanacikgoz/loremppo-sft-400
8B
•
Updated
•
3

emrecanacikgoz/lorem-sft-400
8B
•
Updated
•
3

emrecanacikgoz/SMARTAgent-Mistral-Small-24B-Instruct-2501
24B
•
Updated
•
2

emrecanacikgoz/SMARTAgent-Mistral-Nemo-Instruct-2407
12B
•
Updated
•
2
•
1

emrecanacikgoz/SMARTAgent-Mistral-7B-Instruct-v0.3
7B
•
Updated
•
2
•
1

emrecanacikgoz/SMARTAgent-Llama-3.1-70B
71B
•
Updated
•
5

emrecanacikgoz/SMARTAgent-Llama-3.1-8B
8B
•
Updated
•
2
•
1