CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_6 Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_4 Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_9 Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_7 Text Generation • 2B • Updated May 28 • 4
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_0 Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_5 Text Generation • 0.6B • Updated May 24 • 4
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_2 Text Generation • 0.6B • Updated May 24 • 3
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_8 Text Generation • 0.6B • Updated May 24 • 3
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_6 Text Generation • 0.6B • Updated May 24 • 4
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_12K Text Generation • 0.6B • Updated May 24 • 3
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_1 Text Generation • 0.6B • Updated May 24 • 3
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_9 Text Generation • 0.6B • Updated May 24 • 3
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_3 Text Generation • 0.6B • Updated May 24 • 4
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_shard_4 Text Generation • 0.6B • Updated May 24 • 3