metadata
title: Kashmiri Streaming Asr Zipformer
emoji: π»
colorFrom: purple
colorTo: gray
sdk: docker
pinned: false
short_description: 'Kashmiri streaming ASR with Zipformer '
ποΈ Real-Time Kashmiri Streaming ASR (FastAPI + Sherpa-ONNX)
This project demonstrates a real-time speech-to-text (ASR) web application with:
- ποΈ Hugging Face Deployment taken from Luigi
- π§ Sherpa-ONNX streaming Zipformer model
- π FastAPI backend with WebSocket support
- βοΈ Docker-compatible deployment (CPU-only) on Hugging Face Spaces
π€ Training
- Model: Zipformer Small
- Dataset: IndicVoices
- WER: 36%
π§ͺ Local Development
- Install dependencies
pip install -r requirements.txt
- Run the app locally
uvicorn app.main:app --reload --host 0.0.0.0 --port 8501
Open http://localhost:8501 in your browser.
https://k2-fsa.github.io/sherpa/ncnn/endpoint.html
π Project Structure
.
βββ app
β βββ main.py
β βββ asr.py
β βββ model parts
βββ All Model parts here (encoder, decoder, joiner, tokens)
βββ index.html
βββ requirements.txt
βββ Dockerfile
βββ README.md
π§ Credits
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference