programindz's picture
Update README.md
3517f34 verified
---
title: Kashmiri Streaming Asr Zipformer
emoji: πŸ’»
colorFrom: purple
colorTo: gray
sdk: docker
pinned: false
short_description: 'Kashmiri streaming ASR with Zipformer '
---
# πŸŽ™οΈ Real-Time Kashmiri Streaming ASR (FastAPI + Sherpa-ONNX)
This project demonstrates a real-time speech-to-text (ASR) web application with:
* πŸŽ›οΈ Hugging Face Deployment taken from [Luigi](https://huggingface.co/spaces/Luigi/Streaming-Zipformer)
* 🧠 [Sherpa-ONNX](https://github.com/k2-fsa/sherpa-onnx) streaming Zipformer model
* πŸš€ FastAPI backend with WebSocket support
* ☁️ Docker-compatible deployment (CPU-only) on Hugging Face Spaces
## πŸ€– Training
* Model: [Zipformer Small](https://github.com/k2-fsa/icefall)
* Dataset: [IndicVoices](https://huggingface.co/datasets/ai4bharat/IndicVoices)
* WER: 36%
## πŸ§ͺ Local Development
1. **Install dependencies**
```bash
pip install -r requirements.txt
```
2. **Run the app locally**
```bash
uvicorn app.main:app --reload --host 0.0.0.0 --port 8501
```
Open [http://localhost:8501](http://localhost:8501) in your browser.
[https://k2-fsa.github.io/sherpa/ncnn/endpoint.html](https://k2-fsa.github.io/sherpa/ncnn/endpoint.html)
## πŸ“ Project Structure
```
.
β”œβ”€β”€ app
β”‚ β”œβ”€β”€ main.py
β”‚ β”œβ”€β”€ asr.py
β”‚ └── model parts
└── All Model parts here (encoder, decoder, joiner, tokens)
β”œβ”€β”€ index.html
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ Dockerfile
└── README.md
```
## πŸ”§ Credits
* [Sherpa-ONNX](https://github.com/k2-fsa/sherpa-onnx)
* [OpenCC](https://github.com/BYVoid/OpenCC)
* [FastAPI](https://fastapi.tiangolo.com/)
* [Hugging Face Spaces](https://huggingface.co/docs/hub/spaces)
* [AI4Bharat](https://ai4bharat.iitm.ac.in/)
* [Icefall-K2](https://github.com/k2-fsa/icefall)
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference