BrachioLab (Brachio Lab)

davisrbr

authored a paper 2 months ago

BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks

Paper • 2510.02418 • Published Oct 2 • 2

davisrbr

updated a collection 3 months ago

Adaptive Evaluations

Collection

Datasets for our paper, Adaptively profiling models with task elicitation (EMNLP 2025). • 1 item • Updated Sep 20

davisrbr

published 2 datasets 3 months ago

BrachioLab/legal_generated_questions

Viewer • Updated May 22 • 10.4k • 5

BrachioLab/politeness_generated_questions

Viewer • Updated May 22 • 9.8k • 6

davisrbr

updated 3 datasets 7 months ago

davisrbr

published a dataset 8 months ago

BrachioLab/BSD

Viewer • Updated May 28 • 1 • 11 • 4

helenjin

updated a dataset 8 months ago

BrachioLab/mcmed-cardiac

Viewer • Updated May 12 • 388 • 42

helenjin

published a dataset 8 months ago

BrachioLab/mcmed-cardiac

Viewer • Updated May 12 • 388 • 42

helenjin

updated a dataset 8 months ago

BrachioLab/mcmed-cardiac-full

Viewer • Updated May 8 • 500 • 114

helenjin

published a dataset 8 months ago

BrachioLab/mcmed-cardiac-full

Viewer • Updated May 8 • 500 • 114

helenjin

updated a dataset 8 months ago

BrachioLab/mcmed-cardiac-example

Viewer • Updated May 4 • 8 • 10

helenjin

published a dataset 8 months ago

BrachioLab/mcmed-cardiac-example

Viewer • Updated May 4 • 8 • 10

chaenykim

updated a dataset 8 months ago

BrachioLab/mcmed-sepsis

Viewer • Updated May 3 • 3.25k • 21

chaenykim

published a dataset 8 months ago

BrachioLab/mcmed-sepsis

Viewer • Updated May 3 • 3.25k • 21

fallcat

updated a model 10 months ago

BrachioLab/multirc_vanilla

Updated Feb 26 • 9

fallcat

published a model 10 months ago

BrachioLab/multirc_vanilla

Updated Feb 26 • 9

fallcat

updated 2 datasets about 1 year ago

BrachioLab/massmaps-cosmogrid-100k

Viewer • Updated Nov 5, 2024 • 110k • 53

BrachioLab/cholec

Viewer • Updated Nov 5, 2024 • 1.02k • 35

AI & ML interests

Team members 6

BrachioLab's activity