--- license: apache-2.0 base_model: - meta-llama/Llama-3.1-8B-Instruct tags: - reasoning - agent - program - code --- **CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis** Paper: https://arxiv.org/pdf/2503.23145 Code: https://github.com/Anjiang-Wei/CodeARC Website: https://anjiang-wei.github.io/CodeARC-Website/ ``` @article{wei2025codearc, title={CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis}, author={Wei, Anjiang and Suresh, Tarun and Cao, Jiannan and Kannan, Naveen and Wu, Yuheng and Yan, Kai and Teixeira, Thiago SFX and Wang, Ke and Aiken, Alex}, journal={arXiv preprint arXiv:2503.23145}, year={2025} } ```