arxiv:2407.18418
Wen
Byanka
·
AI & ML interests
None yet
Organizations
models 30
Byanka/vgrpo-hotpot1_1.5B-Instruct
Text Generation • 2B • Updated
• 3
Byanka/confgrpo-hotpot_1-1.5B-Instruct_new
Updated
Byanka/RLVR-hotpot1_1.5B-Instruct
Text Generation • 2B • Updated
• 2
Byanka/RLVR-hotpot_3b
Text Generation • 3B • Updated
• 2
Byanka/RLCR-hotpot_1-1.5B-Instruct
Text Generation • 2B • Updated
• 3
Byanka/RLVR-hotpot1_1.5B
Text Generation • 2B • Updated
• 2
Byanka/RLCR-hotpot_1-1.5B
Text Generation • 2B • Updated
• 2
Byanka/RLCR-hotpot_3b
Updated
Byanka/RLCR-hotpot
Text Generation • 8B • Updated
• 1
Byanka/RLVR-hotpot
Text Generation • 8B • Updated
• 1