suchirsalhan
/

babybabellm-multi-all

Model card Files Files and versions

suchirsalhan commited on Sep 14

Commit

639635b

·

verified ·

1 Parent(s): 2e8ea9f

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -8,18 +8,18 @@ license: mit
 ---
 # babybabellm-multismall
-This repository contains checkpoints for the **multismall** variant of **BabyBabeLLM**.
 ## Files
 - `*_15_16.bin` – main model weights
 - `*_15_16_ema.bin` – EMA smoothed weights
 - `*_15_16_state_dict.bin` – PyTorch state dict
 - `pytorch_model.bin` – extracted EMA weights (for AutoModel)
-- Config + tokenizer files for model loading (zipped in shared_files.zip)
 ## Usage
 ```python
 from transformers import AutoModel, AutoTokenizer
-repo = "suchirsalhan/babybabellm-multismall"
 tokenizer = AutoTokenizer.from_pretrained(repo)
 model = AutoModel.from_pretrained(repo)
 inputs = tokenizer("Hello world!", return_tensors="pt")
@@ -27,5 +27,5 @@ outputs = model(**inputs)
 ```
 ## Notes
 - These are research checkpoints trained on BabyLM-style data.
-- Model naming: `multismall` indicates the language/config variant.

 ---
 # babybabellm-multismall
+This repository contains checkpoints for the **multilingual (all)** variant of **BabyBabeLLM**.
 ## Files
 - `*_15_16.bin` – main model weights
 - `*_15_16_ema.bin` – EMA smoothed weights
 - `*_15_16_state_dict.bin` – PyTorch state dict
 - `pytorch_model.bin` – extracted EMA weights (for AutoModel)
 ## Usage
 ```python
 from transformers import AutoModel, AutoTokenizer
+repo = "suchirsalhan/babybabellm-multi-all"
 tokenizer = AutoTokenizer.from_pretrained(repo)
 model = AutoModel.from_pretrained(repo)
 inputs = tokenizer("Hello world!", return_tensors="pt")
 ```
 ## Notes
 - These are research checkpoints trained on BabyLM-style data.
+- Model naming: `multiall` indicates the language/config variant.