circle-guard-bench / logs /guardbench_20250331_204307_8d77ec17.log
apsys's picture
works
b1cb07d
raw
history blame
2.17 kB
2025-03-31 20:43:08,422 - __main__ - INFO - Initializing leaderboard data...
2025-03-31 20:43:08,623 - __main__ - INFO - Loaded leaderboard with 0 entries
2025-03-31 20:43:08,693 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:08,832 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:08,948 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:09,071 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:09,188 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:09,383 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:09,494 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:09,604 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:09,803 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:10,013 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:10,123 - __main__ - WARNING - Initializing empty leaderboard
2025-03-31 20:43:10,578 - apscheduler.scheduler - INFO - Adding job tentatively -- it will be properly scheduled when the scheduler starts
2025-03-31 20:43:10,579 - apscheduler.scheduler - INFO - Added job "<lambda>" to job store "default"
2025-03-31 20:43:10,579 - apscheduler.scheduler - INFO - Scheduler started
2025-03-31 20:46:56,010 - __main__ - INFO - Received submission for model chatgpt-4o-latest (CoT): /tmp/gradio/a1f2d3a725f7b441a1fbfdac8e51dfd3bf7bbb4ab2d1c20362cfa130f4bdda6d/chatgpt-4o-latest CoT.jsonl
2025-03-31 20:46:56,040 - guardbench.context - INFO - Loading dataset from: whitecircle-ai/guardbench_dataset_1k_public
2025-03-31 20:46:57,488 - guardbench.context - INFO - Successfully loaded dataset with 980 examples
2025-03-31 20:46:57,488 - guardbench.evaluator - INFO - Starting evaluation for model: chatgpt-4o-latest_(CoT)
2025-03-31 20:46:57,488 - guardbench.evaluator - INFO - Using cached results for model: chatgpt-4o-latest_(CoT)
2025-03-31 20:46:57,489 - __main__ - INFO - Refreshing leaderboard data after submission for version v0...
2025-03-31 20:46:57,582 - __main__ - INFO - Refreshed leaderboard data after submission