arxiv:2604.09687
Diji Yang
dyang39
AI & ML interests
None yet
Recent Activity
authored a paper 2 days ago
VULCA-Bench: A Multicultural Vision-Language Benchmark for Evaluating Cultural Understanding authored a paper 2 days ago
Shared Nature, Unique Nurture: PRISM for Pluralistic Reasoning via In-context Structure Modeling authored a paper 2 days ago
Classroom Final Exam: An Instructor-Tested Reasoning Benchmark