On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 28
FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only Paper • 2408.01323 • Published Aug 2, 2024 • 1