view article Article A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality By johndang-cohere and 3 others • Oct 24, 2024 • 62
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 500
RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated 18 days ago • 5
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU By edbeeching and 5 others • Mar 9, 2023 • 60
Compact Language Models via Pruning and Knowledge Distillation Paper • 2407.14679 • Published Jul 19, 2024 • 40
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition Paper • 2302.13750 • Published Feb 27, 2023 • 2
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context By philschmid and 7 others • Jul 23, 2024 • 237
view article Article Welcome Gemma 2 - Google's new open LLM By philschmid and 5 others • Jun 27, 2024 • 130
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 54
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 404
view article Article From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease By muellerzr • Oct 21, 2022 • 34
Tuna: Instruction Tuning using Feedback from Large Language Models Paper • 2310.13385 • Published Oct 20, 2023 • 10