RefalMachine/ruadapt_qwen2.5_3B_ext_u48_instruct_v4 Text Generation • 3B • Updated Dec 31, 2024 • 1k • 28
RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr5e4_bs256 Text Generation • 3B • Updated Oct 15, 2024 • 3
RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr3e4_bs256 Text Generation • 3B • Updated Oct 14, 2024 • 5
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_full_lr5e4_bs256 3B • Updated Oct 13, 2024 • 3
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr5e4_2k_bs256 3B • Updated Oct 11, 2024 • 3
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr3e4_2k_bs256 3B • Updated Oct 11, 2024 • 3
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr2e4_2k_bs256 3B • Updated Oct 11, 2024 • 3
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_full_lr3e4_bs256 3B • Updated Oct 10, 2024 • 3
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_full_lr2e4_bs256 3B • Updated Oct 10, 2024 • 3