ASR ggerganov/whisper.cpp Automatic Speech Recognition • Updated Oct 29, 2024 • 1.33k nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 4.43k • 33 NAMAA-Space/EgypTalk-ASR-v2 Updated Aug 9, 2025 • 244 • 7 Qwen/Qwen3-ASR-0.6B Automatic Speech Recognition • 0.9B • Updated Jan 30 • 401k • 243
nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 4.43k • 33
TTS coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 6.92M • 3.43k MohamedRashad/Arabic-Whisper-CodeSwitching-Edition Automatic Speech Recognition • 2B • Updated Jul 7, 2024 • 827 • 29 nvidia/personaplex-7b-v1 Audio-to-Audio • Updated 17 days ago • 405k • 2.29k nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 5 days ago • 37.4k • 503
MohamedRashad/Arabic-Whisper-CodeSwitching-Edition Automatic Speech Recognition • 2B • Updated Jul 7, 2024 • 827 • 29
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 5 days ago • 37.4k • 503
ASR ggerganov/whisper.cpp Automatic Speech Recognition • Updated Oct 29, 2024 • 1.33k nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 4.43k • 33 NAMAA-Space/EgypTalk-ASR-v2 Updated Aug 9, 2025 • 244 • 7 Qwen/Qwen3-ASR-0.6B Automatic Speech Recognition • 0.9B • Updated Jan 30 • 401k • 243
nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 4.43k • 33
TTS coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 6.92M • 3.43k MohamedRashad/Arabic-Whisper-CodeSwitching-Edition Automatic Speech Recognition • 2B • Updated Jul 7, 2024 • 827 • 29 nvidia/personaplex-7b-v1 Audio-to-Audio • Updated 17 days ago • 405k • 2.29k nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 5 days ago • 37.4k • 503
MohamedRashad/Arabic-Whisper-CodeSwitching-Edition Automatic Speech Recognition • 2B • Updated Jul 7, 2024 • 827 • 29
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 5 days ago • 37.4k • 503
EbrahemHesham/Llama-3.2-11B-Vision-Radiology-mini Image-Text-to-Text • 11B • Updated Nov 16, 2025 • 2