Analyze a song and answer any music question
Generate text based on audio input and questions
Describe audio with questions