Minitron Collection A family of compressed models obtained via pruning and knowledge distillation โข 12 items โข Updated 2 days ago โข 62
Better & Faster Large Language Models via Multi-token Prediction Paper โข 2404.19737 โข Published Apr 30, 2024 โข 81