simonycl/octothinker-8b-hybrid-zero-cold-start-sft-step-5 Text Generation • 8B • Updated 15 days ago • 366
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-462 Text Generation • 3B • Updated 21 days ago • 14
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-400 Text Generation • 3B • Updated 21 days ago • 18
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-300 Text Generation • 3B • Updated 21 days ago • 18
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-200 Text Generation • 3B • Updated 21 days ago • 18
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-100 Text Generation • 3B • Updated 21 days ago • 20
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_60k_win_only Text Generation • 4B • Updated Aug 5 • 7
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_60k_whole Text Generation • 4B • Updated Aug 5 • 7
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_30k_win_only Text Generation • 4B • Updated Aug 5 • 6
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_30k_whole Text Generation • 4B • Updated Aug 5 • 7
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_15k_win_only Text Generation • 4B • Updated Aug 5 • 7
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_5k_win_only Text Generation • 4B • Updated Aug 5 • 5
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_5k_whole Text Generation • 4B • Updated Aug 5 • 6
simonycl/octothinker-3b-hybrid-zero-cold-start-step-5 Text Generation • 3B • Updated Jul 23 • 241