Spatial Audio Learning

university

AI & ML interests

None defined yet.

Higobeatz

authored 3 papers 8 months ago

Noise-robust Speech Separation with Fast Generative Correction

Paper • 2406.07461 • Published Jun 11, 2024

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Paper • 2505.19314 • Published May 25, 2025 • 4

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3, 2025 • 8

jaeyeonkim99

authored a paper 10 months ago

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published Apr 1, 2025 • 15

Higobeatz

authored 4 papers over 1 year ago

DreamVoice: Text-Guided Voice Conversion

Paper • 2406.16314 • Published Jun 24, 2024 • 1

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis

Paper • 2409.07556 • Published Sep 11, 2024 • 2

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer

Paper • 2409.08425 • Published Sep 12, 2024 • 10

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 18

popcornell

authored a paper over 1 year ago

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Paper • 2409.09506 • Published Sep 14, 2024 • 4