|
--- |
|
license: apache-2.0 |
|
base_model: |
|
- meta-llama/Llama-3.1-8B-Instruct |
|
tags: |
|
- reasoning |
|
- agent |
|
- program |
|
- code |
|
--- |
|
**CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis** |
|
|
|
|
|
Paper: https://arxiv.org/pdf/2503.23145 |
|
|
|
Code: https://github.com/Anjiang-Wei/CodeARC |
|
|
|
Website: https://anjiang-wei.github.io/CodeARC-Website/ |
|
|
|
``` |
|
@article{wei2025codearc, |
|
title={CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis}, |
|
author={Wei, Anjiang and Suresh, Tarun and Cao, Jiannan and Kannan, Naveen and Wu, Yuheng and Yan, Kai and Teixeira, Thiago SFX and Wang, Ke and Aiken, Alex}, |
|
journal={arXiv preprint arXiv:2503.23145}, |
|
year={2025} |
|
} |
|
``` |