AuriStreamParallel100M_Group4_BigAudioDataset_180k

AuriStream Parallel is a discrete diffusion speech language model by Greta Tuckute and Klemen Kotar.

Model Details

Parameter	Value
Parameters	~0.12B
Layers	12
Hidden Size	768
Attention Heads	12
Vocab Size	8193
Group Size	4
Mask Schedule	linear_text_prime

Architecture

Bidirectional transformer attention
Grouped token latent projection
Parallel token heads for group-wise prediction
Partial masking diffusion objective

Usage

from transformers import AutoModel

model = AutoModel.from_pretrained(
    "TuKoResearch/AuriStreamParallel100M_Group4_BigAudioDataset_180k",
    trust_remote_code=True,
)

Base Model Code

This checkpoint uses shared model code from TuKoResearch/AuriStreamParallel-base.

Tokenizer

This model is intended for cochlear tokens, e.g. from WavCochCausalV8192.

Downloads last month: 28