Spaces:

Agent-Eval-Refine
/

README

Running

Jiayi-Pan commited on Apr 8, 2024

Commit

56533db

verified ·

1 Parent(s): 8451c9e

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ pinned: false
 ### [Paper - Stay tuned for the Tuesday release!]() | [Code](https://github.com/Berkeley-NLP/Agent-Eval-Refine)
-We explore the design and use of model-based evaluators to both evaluate and autonomously refine the performance of digital agents. Experiments show that domain-general automated evaluators can significantly improve the performance of digital agents, without any extra supervision.
 [Jiayi Pan](https://www.jiayipan.me/), [Yichi Zhang](https://sled.eecs.umich.edu/author/yichi-zhang/), [Nicholas Tomlin](https://people.eecs.berkeley.edu/~nicholas_tomlin/), [Yifei Zhou](https://yifeizhou02.github.io/), [Sergey Levine](https://people.eecs.berkeley.edu/~svlevine/), [Alane Suhr](https://www.alanesuhr.com/)

 ### [Paper - Stay tuned for the Tuesday release!]() | [Code](https://github.com/Berkeley-NLP/Agent-Eval-Refine)
+We design and use model-based evaluators to both evaluate and autonomously refine the performance of digital agents. Experiments show that domain-general automated evaluators can significantly improve the performance of digital agents, without any extra supervision.
 [Jiayi Pan](https://www.jiayipan.me/), [Yichi Zhang](https://sled.eecs.umich.edu/author/yichi-zhang/), [Nicholas Tomlin](https://people.eecs.berkeley.edu/~nicholas_tomlin/), [Yifei Zhou](https://yifeizhou02.github.io/), [Sergey Levine](https://people.eecs.berkeley.edu/~svlevine/), [Alane Suhr](https://www.alanesuhr.com/)