yanhong-li/qwen2_3b_instruct_gdn_v4_hybrid_30attn_trained_s2_gdnv4_num_layer_35_selection Updated 10 days ago
yanhong-li/llama3_3b_gdn_v4_hybrid_22attn_trained_s2_gdnv4_num_layer_35_selection Updated 10 days ago
yanhong-li/llama3_3b_gdn_v4_hybrid_23attn_trained_s2_gdnv4_num_layer_35_selection Updated 10 days ago