Muennighoff's picture
Merge
c22587e
task,metric,value,err,version
anli_r1,acc,0.335,0.014933117490932575,0
anli_r2,acc,0.326,0.014830507204541042,0
anli_r3,acc,0.33666666666666667,0.013647602942406401,0
arc_challenge,acc,0.32764505119453924,0.013715847940719346,0
arc_challenge,acc_norm,0.3728668941979522,0.014131176760131163,0
arc_easy,acc,0.6788720538720538,0.009580787536986797,0
arc_easy,acc_norm,0.6590909090909091,0.009726579593424019,0
boolq,acc,0.652599388379205,0.00832781675259947,1
cb,acc,0.4642857142857143,0.06724777654937658,1
cb,f1,0.3026891807379612,,1
copa,acc,0.83,0.037752516806863715,0
hellaswag,acc,0.4787890858394742,0.004985289555586538,0
hellaswag,acc_norm,0.6365265883290181,0.004800164434233263,0
piqa,acc,0.73449401523395,0.010303308653024427,0
piqa,acc_norm,0.7383025027203483,0.010255630772708227,0
rte,acc,0.5342960288808665,0.030025579819366426,0
sciq,acc,0.927,0.008230354715244054,0
sciq,acc_norm,0.921,0.008534156773333435,0
storycloze_2016,acc,0.7471940138963121,0.010050543909878572,0
winogrande,acc,0.6179952644041041,0.013655578215970418,0