GigaSpeech Series Collection Evolving, Large-Scale, and Multi-domain ASR Corpus • 4 items • Updated 4 days ago
k2SSL Collection A Faster and Better Framework for Self-Supervised Speech Representation Learning • 5 items • Updated 4 days ago
SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing Paper • 2601.09385 • Published 10 days ago
CLSP Collection Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training • 4 items • Updated 4 days ago
CLSP Collection Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training • 4 items • Updated 4 days ago
CLSP Collection Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training • 4 items • Updated 4 days ago
Towards Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training Paper • 2601.03065 • Published 18 days ago