YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Common Accent ASR Model

This is a fine-tuned ASR model based on espnet/owsm_v3.1_ebf_base trained on the DTU54DL/common-accent dataset.

Model details

  • Base model: espnet/owsm_v3.1_ebf_base
  • Language: English
  • Task: Automatic Speech Recognition

Usage

import torch
import numpy as np
from espnet2.bin.s2t_inference import Speech2Text

# Load the model
model = Speech2Text.from_pretrained(
    "reecursion/accent-adaptive-owsm_v3.1_ebf_base",
    lang_sym="<eng>",
    beam_size=1,
    device="cuda" if torch.cuda.is_available() else "cpu"
)

# Example inference
waveform = ... # Load your audio as numpy array
transcription = model(waveform)
print(transcription[0][0]) # Print the transcription
Downloads last month
4
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support