Spaces:

Rafii
/

SpeechSegmenter

Sleeping

SpeechSegmenter / README.md

Update README.md

dff3004 verified 9 days ago

792 Bytes

	---
	title: Speech Segmenter (STT)
	emoji: 🏃
	colorFrom: gray
	colorTo: blue
	sdk: gradio
	sdk_version: 5.39.0
	app_file: app.py
	pinned: false
	short_description: Advanced audio transcription with alignment & diarization
	---

	This Space provides an advanced Speech-to-Text (STT) pipeline enhanced with alignment and speaker diarization:

	- STT (Speech-to-Text): Converts spoken audio into written text (transcription).
	- Alignment: Aligns words with their timestamps in the audio (word-level timing).
	- Speaker Diarization: Detects and labels who spoke when — the “who spoke what” part.
	- Post-processing: Combines all that info to produce a richer, structured transcript.

	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference