arxiv:2412.14161
Wayne Chi
waynechi
·
AI & ML interests
None yet
Recent Activity
updated a dataset 3 days ago
copilot-arena/editbench updated a dataset 3 days ago
copilot-arena/editbench submitted a paper 2 months ago
GameDevBench: Evaluating Agentic Capabilities Through Game Development