open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/Qwen/Qwen3-30B-A3B/main/lcb/results_2025-05-16T19-48-27.946079.json with huggingface_hub
03a587f
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-8B/main/gpqa/results_2025-05-16T19-36-40.520298.json with huggingface_hub
c70deba
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-8B/main/math_500/results_2025-05-16T19-28-47.917794.json with huggingface_hub
db557d6
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000070/aime24/results_2025-05-16T17-11-52.898898.json with huggingface_hub
78d3ba5
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000070/gpqa/results_2025-05-16T16-57-07.605732.json with huggingface_hub
918f4be
verified

edbeeching HF Staff commited on

Upload eval_results/Qwen/Qwen3-4B/main/lcb_v4/results_2025-05-16T16-12-54.880176.json with huggingface_hub
e2e926c
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-0.6B/main/lcb/results_2025-05-16T16-02-28.959345.json with huggingface_hub
8989761
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-1.7B/main/lcb_v4/results_2025-05-16T15-12-29.309961.json with huggingface_hub
d180e4f
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000060/aime24/results_2025-05-16T15-10-36.705353.json with huggingface_hub
3f73fdc
verified

edbeeching HF Staff commited on

Upload eval_results/Qwen/Qwen3-4B/main/aime24/results_2025-05-16T14-56-15.253110.json with huggingface_hub
3d3eb3d
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-1.7B/main/aime24/results_2025-05-16T14-35-41.893313.json with huggingface_hub
e95496e
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000060/aime24/results_2025-05-16T14-35-32.899868.json with huggingface_hub
032aa5a
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000060/gpqa/results_2025-05-16T14-33-13.378175.json with huggingface_hub
f408e34
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000690/lcb_v4/results_2025-05-16T14-19-31.877797.json with huggingface_hub
146d109
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000060/gpqa/results_2025-05-16T14-19-50.195347.json with huggingface_hub
604bda9
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000690/aime24/results_2025-05-16T13-19-37.814292.json with huggingface_hub
7c2ed45
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-30B-A3B/main/lcb_v4/results_2025-05-16T12-52-18.340172.json with huggingface_hub
92e497c
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-4B/main/gpqa/results_2025-05-16T12-45-02.021054.json with huggingface_hub
87711ea
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-4B/main/math_500/results_2025-05-16T12-40-25.028109.json with huggingface_hub
b644ad2
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000690/gpqa/results_2025-05-16T12-33-48.518860.json with huggingface_hub
1095c36
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-1.7B/main/math_500/results_2025-05-16T12-29-47.855866.json with huggingface_hub
0b9f4b8
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-1.7B/main/gpqa/results_2025-05-16T12-29-29.333121.json with huggingface_hub
9a19464
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000050/aime24/results_2025-05-16T12-12-28.187902.json with huggingface_hub
5475610
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000050/aime24/results_2025-05-16T12-11-24.242904.json with huggingface_hub
79421ab
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000050/gpqa/results_2025-05-16T11-55-44.327052.json with huggingface_hub
f5dc520
verified

edbeeching HF Staff commited on

Upload eval_results/Qwen/Qwen3-0.6B/main/lcb_v4/results_2025-05-16T11-38-28.269908.json with huggingface_hub
64e96d8
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000552/lcb_v4/results_2025-05-16T11-33-13.115531.json with huggingface_hub
3edb29f
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-0.6B/main/aime24/results_2025-05-16T11-21-54.037988.json with huggingface_hub
516f241
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-30B-A3B/main/aime24/results_2025-05-16T10-54-03.124531.json with huggingface_hub
74d2531
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-30B-A3B/main/math_500/results_2025-05-16T10-05-34.384465.json with huggingface_hub
d9099ab
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000552/gpqa/results_2025-05-16T09-44-46.613278.json with huggingface_hub
8ed90c6
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen3-30B-A3B/main/gpqa/results_2025-05-16T09-42-33.204000.json with huggingface_hub
01aad7f
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000040/aime24/results_2025-05-16T09-42-14.621208.json with huggingface_hub
7a72ac7
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000040/aime24/results_2025-05-16T09-39-17.637854.json with huggingface_hub
5ad4c1a
verified

edbeeching HF Staff commited on

Upload eval_results/Qwen/Qwen3-0.6B/main/math_500/results_2025-05-16T09-36-21.920375.json with huggingface_hub
b58b204
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000552/aime24/results_2025-05-16T09-32-29.201202.json with huggingface_hub
0454910
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000040/gpqa/results_2025-05-16T09-28-39.311107.json with huggingface_hub
9ef74c4
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000040/gpqa/results_2025-05-16T09-24-20.182820.json with huggingface_hub
d69a2b7
verified

edbeeching HF Staff commited on

Upload eval_results/Qwen/Qwen3-0.6B/main/gpqa/results_2025-05-16T09-21-45.250627.json with huggingface_hub
43a5c7d
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000414/lcb_v4/results_2025-05-16T08-35-48.540546.json with huggingface_hub
eeea1f6
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000414/aime24/results_2025-05-16T07-23-36.958035.json with huggingface_hub
cb4560c
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000030/aime24/results_2025-05-16T07-15-44.570975.json with huggingface_hub
72140c6
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000030/aime24/results_2025-05-16T07-13-49.672988.json with huggingface_hub
50650bd
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000030/gpqa/results_2025-05-16T06-59-21.876698.json with huggingface_hub
a6c0d86
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000030/gpqa/results_2025-05-16T06-56-40.738341.json with huggingface_hub
a9fea7b
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.30-step-000000414/gpqa/results_2025-05-16T06-48-23.147769.json with huggingface_hub
338d992
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000020/aime24/results_2025-05-16T06-09-30.932291.json with huggingface_hub
1cb6414
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000020/aime24/results_2025-05-16T06-08-19.905363.json with huggingface_hub
7d7184f
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v08.00-step-000000020/gpqa/results_2025-05-16T05-52-45.441892.json with huggingface_hub
27fefda
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B-Merges-GRPO/v09.00-step-000000020/gpqa/results_2025-05-16T05-52-12.466744.json with huggingface_hub
0f4a532
verified

edbeeching HF Staff commited on