AI & ML interests
None defined yet.
Recent Activity
models 45
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step300
4B • Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step250
4B • Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advprm-n5-eta200-stepLen256-stepSplit-length-step400
4B • Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advorm-n5-eta200-stepLen256-stepSplit-length-step250
4B • Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta200-stepLen256-stepSplit-length-step250
1B • Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advorm-n5-eta100-stepLen256-stepSplit-nn-step200
4B • Updated
PRM-CoT/Llama-3.2-3B-Instruct-numina-grpo-prm_advorm-n5-eta0-stepLen0-stepSplit-nn-step200
4B • Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta0-stepLen0-stepSplit-nn-step500
1B • Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta100-stepLen256-stepSplit-nn-step500
1B • Updated
PRM-CoT/Llama-3.2-1B-Instruct-numina-grpo-prm_advorm-n5-eta100-stepLen256-stepSplit-nn-step400
1B • Updated