CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-4k-demo_with_reasoning_v2_orchard 8B • Updated about 17 hours ago • 3
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v2_orchard Updated about 17 hours ago
CohenQu/Qwen2.5-ARC-AGI-4-8-10_3x128_shuffled_tb_32_bs_512_minibs_32_microbs_16_n_16_tp_0.6 3B • Updated 4 days ago • 12
CohenQu/Qwen3-1.7B-ARC-AGI-4-8-10_tb_64_bs_256_minibs_16_microbs_16_n_16 2B • Updated 9 days ago • 19
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-with_reasoning Text Generation • 1B • Updated 12 days ago • 35
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_with_reasoning_fixed_DSAI 8B • Updated 14 days ago • 29
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_without_reasoning_fixed_DSAI 8B • Updated 14 days ago • 30
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_256_minibs_16_microbs_16_n_16 2B • Updated 15 days ago • 12
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_512_minibs_16_microbs_16_n_32 2B • Updated 15 days ago • 23
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-without_reasoning Text Generation • 1B • Updated 16 days ago • 31
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_without_reasoning_fixed_DSAI Feature Extraction • 8B • Updated 17 days ago • 40
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_without_reasoning_DSAI Feature Extraction • 8B • Updated 17 days ago • 71
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_fixed_DSAI Feature Extraction • 8B • Updated 17 days ago • 59
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_DSAI Feature Extraction • 8B • Updated 17 days ago • 70
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_128_minibs_32_microbs_32_n_4 2B • Updated 17 days ago • 14
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_128_minibs_16_microbs_16_n_8 2B • Updated 17 days ago • 15
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-math-4k-with_reasoning Text Generation • 1B • Updated 17 days ago • 88
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-math-4k-without_reasoning Text Generation • 1B • Updated 18 days ago • 73
CohenQu/Qwen2.5-3B-Instruct_Continue_vs_Terminate.05.00 Text Generation • 3B • Updated 19 days ago • 46
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.01.4_orchard Text Generation • 4B • Updated 22 days ago • 8
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.01.2_orchard Text Generation • 4B • Updated 22 days ago • 8
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.00.4_orchard Text Generation • 4B • Updated 22 days ago • 11
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.01.1_orchard Text Generation • 4B • Updated 23 days ago • 11
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.00.2_orchard Text Generation • 4B • Updated 23 days ago • 12
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.00.1_orchard Text Generation • 4B • Updated 23 days ago • 49
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.00.01_orchard Text Generation • 4B • Updated 23 days ago • 53
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.00.00_orchard Text Generation • 4B • Updated 23 days ago • 61