Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published 10 days ago • 7
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published 10 days ago • 7
SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection Paper • 2603.20686 • Published 9 days ago • 3
MathBridge: A Large-Scale Dataset for Translating Mathematical Expressions into Formula Images Paper • 2408.07081 • Published Aug 7, 2024 • 2
SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection Paper • 2603.20686 • Published 9 days ago • 3