Muennighoff's picture
Merge
c22587e
task,metric,value,err,version
anli_r1,acc,0.339,0.014976758771620347,0
anli_r2,acc,0.313,0.014671272822977886,0
anli_r3,acc,0.3425,0.013704669762934727,0
arc_challenge,acc,0.3122866894197952,0.013542598541688065,0
arc_challenge,acc_norm,0.3412969283276451,0.01385583128749772,0
arc_easy,acc,0.6439393939393939,0.009825454608416316,0
arc_easy,acc_norm,0.6035353535353535,0.010037412763064529,0
boolq,acc,0.6131498470948012,0.008518188340844762,1
cb,acc,0.5,0.06741998624632421,1
cb,f1,0.35220125786163514,,1
copa,acc,0.8,0.040201512610368445,0
hellaswag,acc,0.49502091216889066,0.004989533998820352,0
hellaswag,acc_norm,0.6565425214100776,0.0047389206247244785,0
piqa,acc,0.766050054406964,0.009877236895137455,0
piqa,acc_norm,0.7622415669205659,0.009932525779525492,0
rte,acc,0.5451263537906137,0.029973636495415252,0
sciq,acc,0.909,0.009099549538400236,0
sciq,acc_norm,0.89,0.009899393819724442,0
storycloze_2016,acc,0.7354355959380011,0.010200400541714168,0
winogrande,acc,0.6053670086819258,0.013736915172371885,0