Substance Beats Style: Why Beginning Students Fail to Code with LLMs Paper • 2410.19792 • Published Oct 15, 2024
AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans Paper • 2509.21891 • Published Sep 26
StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code Paper • 2306.04556 • Published Jun 7, 2023
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation Paper • 2208.08227 • Published Aug 17, 2022 • 1
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published Feb 3 • 9