Kazuki1450/Olmo-3-1025-7B_dsum_3_6_length_fp_btw_484_516_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_length_fp_btw_468_532_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_length_fp_btw_284_316_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_length_fp_btw_368_432_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_length_fp_btw_268_332_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_length_fp_btw_384_416_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_length_fp_btw_484_516_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_length_fp_btw_468_532_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_length_fp_btw_368_432_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_length_fp_btw_284_316_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_length_fp_btw_268_332_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_length_fp_btw_384_416_1p0_0p0_1p0_grpo_42_rule Updated 30 days ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_rel_1e-5_1p0_0p0_1p0_grpo_42_rule Updated about 1 month ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_rel_1e-4_1p0_0p0_1p0_grpo_42_rule Updated about 1 month ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_rel_1e-3_1p0_0p0_1p0_grpo_42_rule Updated about 1 month ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_rel_1e-2_1p0_0p0_1p0_grpo_42_rule Updated about 1 month ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_fnr_no_bracket_0p0_0p0_1p0_grpo_42_rule Updated about 1 month ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_42_rule Updated about 1 month ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_rel_1e-1_1p0_0p0_1p0_grpo_42_rule Updated about 1 month ago