Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
TeddyYao
/
grok4-gpqa-eval
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
grok4-gpqa-eval / benchmarks
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
TeddyYao's picture
TeddyYao
Upload 38 files
8474f02 verified 22 days ago
  • __pycache__
    Upload 38 files 22 days ago
  • __init__.py
    740 Bytes
    Upload 38 files 22 days ago
  • base_benchmark.py
    4.67 kB
    Upload 38 files 22 days ago
  • evaluation_utils.py
    4.93 kB
    Upload 38 files 22 days ago
  • gpqa_benchmark.py
    4.76 kB
    Upload 38 files 22 days ago
  • gsm8k_benchmark.py
    4.52 kB
    Upload 38 files 22 days ago
  • humaneval_benchmark.py
    4.85 kB
    Upload 38 files 22 days ago
  • math_benchmark.py
    4.68 kB
    Upload 38 files 22 days ago
  • mmlu_benchmark.py
    5.61 kB
    Upload 38 files 22 days ago
  • prompt_templates.py
    4.08 kB
    Upload 38 files 22 days ago