DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving Paper • 2312.09245 • Published Dec 14, 2023
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue Paper • 2510.13747 • Published 5 days ago • 28