🐤 BERT pre-training checkpoints used for analyzing early learning dynamics in "The Subspace Chronicles" (Müller-Eberstein et al., 2023).
Max
personads
AI & ML interests
Natural Language Processing | Representation Learning | Learning Dynamics
Recent Activity
liked
a model
22 days ago
NLPnorth/snakmodel-7b-instruct-mlx-4bit
updated
a model
22 days ago
NLPnorth/snakmodel-7b-instruct
published
a model
22 days ago
NLPnorth/snakmodel-7b-instruct-mlx-4bit