CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_120K Text Generation • 2B • Updated May 28 • 14
CodeAtCMU/Qwen3-0.6B-Base_full_sft_natural_language_data_120K Text Generation • 0.6B • Updated May 28 • 17
CodeAtCMU/gemma-3-4b-pt_full_sft_natural_language_data_120K Image-Text-to-Text • 4B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_3 Text Generation • 2B • Updated May 28 • 14
CodeAtCMU/Qwen3-4B-Base_full_sft_natural_language_data_120K Text Generation • 4B • Updated May 28 • 14
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_2 Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_8 Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_5 Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_12K Text Generation • 2B • Updated May 28 • 3
CodeAtCMU/Qwen3-1.7B-Base_full_sft_natural_language_data_shard_1 Text Generation • 2B • Updated May 28 • 4