2025-03-31 20:43:08,422 - __main__ - INFO - Initializing leaderboard data... 2025-03-31 20:43:08,623 - __main__ - INFO - Loaded leaderboard with 0 entries 2025-03-31 20:43:08,693 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:08,832 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:08,948 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:09,071 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:09,188 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:09,383 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:09,494 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:09,604 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:09,803 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:10,013 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:10,123 - __main__ - WARNING - Initializing empty leaderboard 2025-03-31 20:43:10,578 - apscheduler.scheduler - INFO - Adding job tentatively -- it will be properly scheduled when the scheduler starts 2025-03-31 20:43:10,579 - apscheduler.scheduler - INFO - Added job "" to job store "default" 2025-03-31 20:43:10,579 - apscheduler.scheduler - INFO - Scheduler started 2025-03-31 20:46:56,010 - __main__ - INFO - Received submission for model chatgpt-4o-latest (CoT): /tmp/gradio/a1f2d3a725f7b441a1fbfdac8e51dfd3bf7bbb4ab2d1c20362cfa130f4bdda6d/chatgpt-4o-latest CoT.jsonl 2025-03-31 20:46:56,040 - guardbench.context - INFO - Loading dataset from: whitecircle-ai/guardbench_dataset_1k_public 2025-03-31 20:46:57,488 - guardbench.context - INFO - Successfully loaded dataset with 980 examples 2025-03-31 20:46:57,488 - guardbench.evaluator - INFO - Starting evaluation for model: chatgpt-4o-latest_(CoT) 2025-03-31 20:46:57,488 - guardbench.evaluator - INFO - Using cached results for model: chatgpt-4o-latest_(CoT) 2025-03-31 20:46:57,489 - __main__ - INFO - Refreshing leaderboard data after submission for version v0... 2025-03-31 20:46:57,582 - __main__ - INFO - Refreshed leaderboard data after submission