Speaker diarization, speake segmentation,
Generate subtitles (SRT) from video or audio files
NVIDIA Parakeet speech recognition for the browser
Generate captions from audio
SAM 3 is a foundation model for promptable segmentation
Gradio demo for MatAnyone 1 & 2
text to video, image to video, video extend