--- title: Real-time Speech-to-Text emoji: 🎙️ colorFrom: indigo colorTo: gray sdk: gradio sdk_version: 5.29.0 app_file: app.py pinned: false --- # Real-time Speech-to-Text with NeMo This is a real-time speech-to-text transcription application powered by NVIDIA NeMo and the parakeet-tdt-0.6b-v2 model. ## Features - 🎙️ Web-based microphone input - ⚡ Real-time transcription displayed in the browser - 🧠 Fast inference with NeMo pre-trained model - 🛠️ Easy to use, no installations required ## Tech Stack - Python - Gradio - NVIDIA NeMo Toolkit for ASR ## How to Use 1. Click the microphone button to start recording 2. Speak clearly into your microphone 3. The transcription will appear in real-time 4. Click 'Clear Transcript' to start a new transcription ## Note This application requires access to your microphone to function. The audio is processed in real-time and is not stored.