DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_100935 Updated 23 minutes ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_100503 Updated 37 minutes ago
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-nano-2025-08-07-20260114_083247 Updated 42 minutes ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_063530 Viewer • Updated about 4 hours ago • 297
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc8128d14e Viewer • Updated about 7 hours ago • 320
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc86d58830 Viewer • Updated about 8 hours ago • 300
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc681260e0 Viewer • Updated about 13 hours ago • 228
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocaae8961a Viewer • Updated about 13 hours ago • 214
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_lr_1e-5_Qwen307d4a0be Viewer • Updated 1 day ago • 216
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocd012f2db Viewer • Updated 3 days ago • 315 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocbf0cdce5 Viewer • Updated 3 days ago • 216 • 11
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocabfcbbc3 Viewer • Updated 4 days ago • 331 • 11