Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 5 days ago • 44
Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 11 days ago • 7
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published 24 days ago • 36