DynMoE model checkpoints and paper on huggingface
-
LINs-lab/DynMoE-StableLM-1.6B
Text Generation • 3B • Updated • 10 • 2 -
LINs-lab/DynMoE-Qwen-1.8B
Text Generation • 3B • Updated • 15 • 2 -
LINs-lab/DynMoE-Phi-2-2.7B
Text Generation • 6B • Updated • 6 • 4 -
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
Paper • 2405.14297 • Published • 3