Convert audio to text using automatic speech recognition
Generate Vietnamese voice from text and sample audio