Spaces:
Running
NVIDIA RAG Example
This is an example that shows how to use ACETransport to communicate with a NVIDIA RAG server. It supports Nvidia Riva ASR and TTS.
Get Started
From the example directory, run the following commands to create a virtual environment and install the dependencies:
uv venv
uv sync
source .venv/bin/activate
Update the secrets in the .env file.
cp env.example .env # and add your credentials
Deploy NVIDIA RAG server
Follow instructions here https://gitlab-master.nvidia.com/chat-labs/OpenSource/ai-chatbot
Deploy local Riva ASR and TTS models.
Prerequisites
You have access and are logged into NVIDIA NGC. For step-by-step instructions, refer to the NGC Getting Started Guide.
You have access to an NVIDIA Volta™, NVIDIA Turing™, or an NVIDIA Ampere architecture-based A100 GPU. For more information, refer to the Support Matrix.
You have Docker installed with support for NVIDIA GPUs. For more information, refer to the Support Matrix.
Download Riva Quick Start
Go to the Riva Quick Start for Data center. Select the File Browser tab to download the scripts or use the NGC CLI tool to download from the command line.
ngc registry resource download-version nvidia/riva/riva_quickstart:2.19.0
Deploy Riva Speech Server
From the example directory, run below commands:
cd riva_quickstart_v2.19.0
chmod +x riva_init.sh riva_clean.sh riva_start.sh
bash riva_clean.sh ../../utils/riva_config.sh
bash riva_init.sh ../../utils/riva_config.sh
bash riva_start.sh ../../utils/riva_config.sh
cd ..
This may take few minutes for the first time and will start the riva server on localhost:50051.
For more info, you can refer to the Riva Quick Start Guide.
Run the bot pipeline
python examples/nvidia_rag/bot.py
This will host the static web client along with the ACE controller server, visit http://WORKSTATION_IP:8100/static/index.html in your browser to start a session.
Note: For mic access, you will need to update chrome://flags/ and add http://WORKSTATION_IP:8100 in Insecure origins treated as secure section.