Santiago Garcia's picture

Santiago Garcia

santyzenith

·

AI & ML interests

Large language models, Natural Language Processing, Computer Vision, Spanish Large language models.

Recent Activity

liked a model 26 days ago

moonshotai/Kimi-K2-Instruct

liked a model about 1 month ago

google/t5gemma-2b-2b-prefixlm

liked a model about 1 month ago

google/gemma-3n-E4B-it

View all activity

Organizations

upvoted an article 2 months ago

Article

A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality

By

and 3 others •

Oct 24, 2024

• 62

upvoted an article 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 500

upvoted a collection 6 months ago

DeepSeek-VL2

5 items • Updated Feb 9 • 75

upvoted a collection 7 months ago

BGE

30 items • Updated May 20 • 127

upvoted a collection 8 months ago

RLHF

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated 18 days ago • 5

upvoted a collection 10 months ago

LLM2Vec

16 items • Updated Oct 8, 2024 • 45

upvoted 3 articles 11 months ago

Article

Train a Llama model from scratch

By

•

Jul 29, 2024

• 52

Article

Vision Language Models Explained

By

and 1 other •

Apr 11, 2024

• 429

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

By

and 5 others •

Mar 9, 2023

• 60

upvoted 2 papers 12 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 40

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Paper • 2302.13750 • Published Feb 27, 2023 • 2

upvoted 3 articles 12 months ago

Article

Introduction to Graph Machine Learning

By

•

Jan 3, 2023

• 40

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

By

and 7 others •

Jul 23, 2024

• 237

Article

Welcome Gemma 2 - Google's new open LLM

By

and 5 others •

Jun 27, 2024

• 130

upvoted a paper about 1 year ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 54

upvoted 2 articles about 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 404

Article

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

By

•

Oct 21, 2022

• 34

upvoted a paper about 1 year ago

Tuna: Instruction Tuning using Feedback from Large Language Models

Paper • 2310.13385 • Published Oct 20, 2023 • 10

upvoted a collection about 1 year ago

Knowledge distillation

88 items • Updated Feb 7, 2024 • 7

upvoted an article about 1 year ago

Article

Putting RL back in RLHF

By

and 1 other •

Jun 12, 2024

• 99