ProgramBench
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
ProgramBench: Can Language Models Rebuild Programs From Scratch?
Given only a compiled binary and its documentation, AI agents must architect and implement a complete codebase that reproduces the original program's behavior. ProgramBench evaluates this capability across 200 real-world open-source projects spanning Rust, Go, C, C++, Haskell, and Java.
Links
models 0
None public yet
datasets 0
None public yet