nvidia/parakeet-tdt-0.6b-v2 Automatic Speech Recognition β’ 0.6B β’ Updated 28 days ago β’ 886k β’ 1.25k
view reply It's not prompted. The source Audio had that emotional context and the model simply copied it.