Commit History
Upload from nightly evaluation run
c3be561
verified
Upload from nightly evaluation run
717a6f5
verified
Upload from nightly evaluation run
d1d78d1
verified
Upload from nightly evaluation run
5c42feb
verified
Upload from nightly evaluation run
db2b129
verified
Upload from nightly evaluation run
2209216
verified
Upload from nightly evaluation run
803054c
verified
Upload from nightly evaluation run
1f5f72f
verified
Upload from nightly evaluation run
7bb626a
verified
Upload from nightly evaluation run
ca0992e
verified
Upload from GitHub Actions: Add math benchmarks
549360a
verified
Upload from GitHub Actions: More results
52abc5b
verified
Upload from nightly evaluation run
4a34e67
verified
Upload from GitHub Actions: Update model ranking fetching
f840423
verified
Upload from GitHub Actions: Use FLORES+ via Huggingface
913253a
verified
Upload from nightly evaluation run
7fce0be
verified
Upload from nightly evaluation run
7e8d13c
verified
Upload from GitHub Actions: Quick fixes
9c2c019
verified
Upload from nightly evaluation run
9ee89ef
verified
Upload from nightly evaluation run
3c24b37
verified
Upload from nightly evaluation run
a893400
verified
Upload from nightly evaluation run
124b16a
verified
Upload from nightly evaluation run
3270126
verified
Upload from nightly evaluation run
8a4050a
verified
Upload from GitHub Actions: More models
0bd935e
verified
Upload from nightly evaluation run
1d4c8a4
verified
Upload from GitHub Actions: Increase n_models
d09b095
verified
Upload from GitHub Actions: New results
b311dd5
verified
Upload from GitHub Actions: Merge pull request #4 from datenlabor-bmz/jonas-dev
7c6a118
verified
Upload from nightly evaluation run
47bcf10
verified
Upload from GitHub Actions: Fix vibecoding
75010c2
verified
Upload from GitHub Actions: Ugly fix for CI errors
adc94d7
verified
Upload from GitHub Actions: Use Python 3.12 in all environments (uv, Docker, CI) and upgrade packages
a61d2b3
verified
Upload from GitHub Actions: Try moving `cache` calls that cause CI issues
bc4afa0
verified
Upload from GitHub Actions: Exclude free models from evals
c9e9db6
verified
Upload from GitHub Actions: Change HF repo URL
3409596
verified
Upload from nightly evaluation run
dcb356d
verified
Upload from nightly evaluation run
d2c1cb4
verified
Upload from GitHub Actions: Display N/A scores as such
1e8952a
verified
Upload from GitHub Actions: Copy data files to Docker image
6f68367
verified
Only upload relevant files
b258643
David Pomerenke
commited on
Use uv for pushing to HF
c144fd8
David Pomerenke
commited on
Update evaluation results [skip ci]
53c941c
github-actions[bot]
commited on
Try unsetting stuff
006f88d
David Pomerenke
commited on
Fix pushing
ce9de0c
David Pomerenke
commited on
Use GH_PAT
9851df9
David Pomerenke
commited on
Use GITHUB_TOKEN
6dd85c2
David Pomerenke
commited on
Block gemini-2.5-pro-exp-03-25
092c06a
David Pomerenke
commited on
Pass through kwargs
5fa433f
David Pomerenke
commited on