Muennighoff's picture
Merge
c22587e
task,metric,value,err,version
anli_r1,acc,0.329,0.014865395385928364,0
anli_r2,acc,0.323,0.01479492784334864,0
anli_r3,acc,0.3425,0.013704669762934725,0
arc_challenge,acc,0.3310580204778157,0.01375206241981784,0
arc_challenge,acc_norm,0.3575085324232082,0.014005494275916573,0
arc_easy,acc,0.6632996632996633,0.009697166595752472,0
arc_easy,acc_norm,0.6397306397306397,0.00985100258473238,0
boolq,acc,0.6412844036697247,0.0083886680340594,1
cb,acc,0.5178571428571429,0.06737697508644648,1
cb,f1,0.36209571547917413,,1
copa,acc,0.8,0.040201512610368445,0
hellaswag,acc,0.4794861581358295,0.0049855800659464565,0
hellaswag,acc_norm,0.6293567018522207,0.00481989994534249,0
piqa,acc,0.7388465723612623,0.010248738649935578,0
piqa,acc_norm,0.7301414581066377,0.010356595421852195,0
rte,acc,0.5703971119133574,0.02979666882912467,0
sciq,acc,0.916,0.008776162089491115,0
sciq,acc_norm,0.899,0.009533618929340987,0
storycloze_2016,acc,0.7295563869588455,0.010271810373331027,0
winogrande,acc,0.6148382004735596,0.013676821287521412,0