Spaces:
Sleeping
Sleeping
File size: 2,456 Bytes
c60174c 2f93b69 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 |
# π£οΈ Accent Identifier
This tool identifies the **speaker's accent** from a video or audio input. It supports uploads and URLs β including **direct `.mp4` links**, **Loom videos**, and **YouTube-style links** β and uses a deep learning model from [SpeechBrain](https://speechbrain.readthedocs.io/en/latest/index.html) for inference.
## π Demo
Try it out live on [Hugging Face Spaces](https://pheire-accent-detector.hf.space) *(replace with your actual link)*.
---
## π¦ Features
* π₯ Accepts video/audio uploads (`.mp4`, `.wav`, `.mp3`)
* π Handles direct URLs (e.g. Loom, direct `.mp4`, YouTube)
* π§ Classifies accent using `speechbrain` pretrained model
* π Returns top prediction and top-3 probabilities
* β‘ Fast and easy UI built with [Gradio](https://gradio.app)
---
## π§ͺ Example Inputs
* `https://www.loom.com/share/abc123`
* `https://yourdomain.com/sample.mp4`
* Uploaded audio: `voice_sample.wav`
---
## π οΈ Installation
```bash
git clone https://github.com/yourusername/accent-identifier.git
cd accent-identifier
# Create virtual environment (optional but recommended)
python -m venv venv
source venv/bin/activate # On Windows use `venv\Scripts\activate`
# Install dependencies
pip install -r requirements.txt
```
### requirements.txt
```
speechbrain
gradio
torchaudio
torch
ffmpeg-python
yt-dlp
requests
```
Make sure `ffmpeg` is installed and available in your system path.
You can test with: `ffmpeg -version`
---
## βΆοΈ Run Locally
```bash
python app.py
```
This will launch a Gradio interface in your browser at `http://localhost:7860`.
---
## π§ Model Details
* **Model**: `Jzuluaga/accent-id-commonaccent_ecapa`
* **Framework**: [SpeechBrain](https://speechbrain.readthedocs.io/)
* **Classes**: US, UK, Australia, Canada, India, etc.
---
## π Project Structure
```
accent-identifier/
βββ app.py # Main Gradio app
βββ requirements.txt # Dependencies
βββ README.md # You are here
```
---
## π§© Notes
* Loom support relies on their internal API. It may break if they change the endpoint.
* Audio is extracted to `.wav` using `ffmpeg` with 16kHz mono format for model compatibility.
---
title: Accent Detector
emoji: π’
colorFrom: blue
colorTo: blue
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|