Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
BenchHub
non-profit
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
amphora
Â
submitted
a paper
4 days ago
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs
EunsuKim
Â
updated
a dataset
9 days ago
BenchHub/BenchHub-Ko
amphora
Â
submitted
a paper
3 months ago
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math
View all activity
Team members
3
BenchHub
's Spaces
1
Sort:Â Recently updated
Sleeping
BenchHub
📊
Customize and evaluate LLMs using BenchHub