Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrea Santilli's picture
12 7 13

Andrea Santilli

teelinsan
erodola's profile picture gsarti's profile picture alenic's profile picture
·
https://www.santilli.xyz/
  • teelinsan
  • teelinsan

AI & ML interests

Natural Language Processing

Organizations

BigScience Workshop's profile picture Gladia Research Group's profile picture Risorse per la Lingua Italiana's profile picture BigCode's profile picture Talking LLMs's profile picture Hugging Face Discord Community's profile picture

upvoted a paper 2 months ago

Attention Sinks in Diffusion Language Models

Paper • 2510.15731 • Published Oct 17, 2025 • 48
upvoted a paper 3 months ago

Language Models are Injective and Hence Invertible

Paper • 2510.15511 • Published Oct 17, 2025 • 69
upvoted a paper 8 months ago

Mergenetic: a Simple Evolutionary Model Merging Library

Paper • 2505.11427 • Published May 16, 2025 • 14
upvoted a collection 9 months ago

Hermes 3

Collection
The Hermes 3 Series of Models • 11 items • Updated Sep 8, 2025 • 132
upvoted a collection 10 months ago

DeepHermes

Collection
Preview models of hybrid reasoner Hermes series • 6 items • Updated Sep 8, 2025 • 41
upvoted a paper almost 2 years ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189
upvoted a paper over 2 years ago

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Paper • 2304.01373 • Published Apr 3, 2023 • 9
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs