ML-SUPERB: Multilingual Speech Universal PERformance Benchmark Paper โข 2305.10615 โข Published May 18, 2023 โข 1
Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning Paper โข 2309.15317 โข Published Sep 26, 2023
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data Paper โข 2309.13876 โข Published Sep 25, 2023 โข 1
Improving Massively Multilingual ASR With Auxiliary CTC Objectives Paper โข 2302.12829 โข Published Feb 24, 2023
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper โข 2401.16658 โข Published Jan 30, 2024 โข 14
YODAS: Youtube-Oriented Dataset for Audio and Speech Paper โข 2406.00899 โข Published Jun 2, 2024 โข 4