Spaces:

EdgarDataScientist
/

REM_WASTE_INTERVIEW

Running

App Files Files Community

EdgarDataScientist commited on 14 days ago

Commit

d074961

verified ·

1 Parent(s): 8cdbd03

Update README.md

Browse files

Files changed (1) hide show

README.md +112 -1

README.md CHANGED Viewed

@@ -1,10 +1,121 @@
 ---
 title: REM_WASTE_INTERVIEW
 emoji: 🎤
 colorFrom: indigo
 colorTo: red
 sdk: streamlit
-sdk_version: 1.31.1
 app_file: app.py
 pinned: false
 ---

+# English Accent Detection Tool
+## Project Overview
+This tool is a working proof-of-concept designed to evaluate spoken English in candidate video submissions. It automatically extracts audio from a public video or uploaded file, identifies whether the language spoken is English, and classifies the English accent (e.g., American, British, Australian). A confidence score is also provided to aid in candidate screening.
+This submission was developed as part of the REM Waste hiring challenge, with emphasis on practicality, technical clarity, and clean design.
+---
+## Features
+* Accepts public video URLs (e.g., Loom, MP4 links) or uploaded video/audio files.
+* Extracts audio using `ffmpeg`.
+* Detects the spoken language using `SpeechBrain`'s language identification model.
+* If English is detected, simulates classification into common English accents.
+* Outputs include:
+  * Accent classification
+  * Confidence score (0–100%)
+  * Brief summary
+---
+## Live Demo
+Deployed Streamlit app (hosted on Streamlit Cloud):
+**\[Live App URL – Insert Link Here]**
+---
+## Technology Stack
+* **Python 3**
+* **Streamlit** for the web interface
+* **SpeechBrain** for spoken language identification
+* **Torchaudio** for audio preprocessing
+* **FFMPEG** for audio extraction
+* **Requests, Matplotlib** for I/O and optional output handling
+---
+## How It Works
+1. The user inputs a video URL or uploads a file.
+2. The audio is extracted and resampled to a suitable format.
+3. The system determines whether the speaker is using English.
+4. If English is detected, the tool classifies the accent based on common linguistic traits.
+5. The result includes:
+   * Accent label (e.g., British)
+   * Confidence score
+   * Explanation or notes
+---
+## Local Setup Instructions
+1. Clone the repository:
+   ```bash
+   git clone https://github.com/yourusername/english-accent-detector.git
+   cd english-accent-detector
+   ```
+2. Install dependencies:
+   ```bash
+   pip install -r requirements.txt
+   ```
+3. Launch the app:
+   ```bash
+   streamlit run app.py
+   ```
+---
+## Requirements
+```
+streamlit
+torch
+torchaudio
+speechbrain
+ffmpeg-python
+requests
+matplotlib
+```
+---
+## Notes
+* Accent classification is simulated based on common accent features, due to the lack of an open-source, fine-grained English accent classifier.
+* The core English language detection is handled by a pre-trained SpeechBrain model.
+* This project was developed as a rapid prototype within the recommended 4–6 hour window and can be expanded into a production-grade system with access to more detailed accent datasets and APIs.
+---
+## Author
+Developed by Edgar Muyale
+For inquiries: edgarmuyale@gmail.com
+Submission for REM Waste Hiring Challenge
 ---
 title: REM_WASTE_INTERVIEW
 emoji: 🎤
 colorFrom: indigo
 colorTo: red
 sdk: streamlit
+sdk_version: 1.45.1
 app_file: app.py
 pinned: false
 ---