Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
1
Thomas Broadley
tbroadley
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 12 hours ago
metr-evals/apps-with-input-validation
new
activity
about 12 hours ago
metr-evals/apps-with-input-validation:
Unify verification scripts into verify.py, add AGENTS.md
new
activity
about 12 hours ago
metr-evals/apps-with-input-validation:
Fix trailing empty strings in 94 train strs samples
View all activity
Organizations
tbroadley
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
paper
3 months ago
Measuring AI Ability to Complete Long Tasks
Paper
•
2503.14499
•
Published
Mar 18, 2025
•
16