dots.llm1.base 🪐 a 142B MoE model with only 14B active params.
rednote-hilab/dotsllm1-68246aaaaba3363374a8aa7c ✨ Base & Instruct - MIT license ✨ Trained on 11.2T non-synthetic high-quality data ✨ Competitive with Qwen2.5/3 on reasoning, code, alignment