Yifan Mai's picture

3

Yifan Mai

yifanmai

·

yifanmai

AI & ML interests

None yet

Recent Activity

new activity 2 days ago

evaleval/EEE_datastore:Add HELM Safety v1.17.0 results

authored a paper 5 days ago

VHELM: A Holistic Evaluation of Vision Language Models

authored a paper 5 days ago

AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies

View all activity

Organizations

Papers 12

arxiv:2511.20836

arxiv:2510.11977

arxiv:2508.21376

arxiv:2505.21972

models 0

None public yet

datasets 3

yifanmai/arabic-enterprise

Viewer • Updated 17 days ago • 721 • 26

yifanmai/czech_bank_qa

Viewer • Updated Dec 19, 2024 • 132 • 695

yifanmai/call-center

Viewer • Updated Aug 28, 2024 • 725 • 3 • 4