Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens Paper • 2602.16687 • Published 5 days ago • 4
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis Paper • 2505.02625 • Published May 5, 2025 • 23