👉 fffiloni/image-2-music-v3 | Feel free to test it and share feedback.
Just wiring together: merve/moondream3 * victor/ace-step-jam
Image → prompt → audio | Early version, will evolve | Follow: @fffiloni
Deeply interrogate audio file content
Long video understanding with smart attention
Extraction & Reconstruction for Efficient Speech Separation
Detect and split video shots into clips with thumbnails
Generate high‑quality images from text prompts
Apache Licensed Advanced Video Generation Model
Animation Sketches sequence Colorization
Aesthetically Controllable Text-Driven Stylization w/o Train
Deeply interrogate audio file content
Long video understanding with smart attention
Extraction & Reconstruction for Efficient Speech Separation
Detect and split video shots into clips with thumbnails
Generate high‑quality images from text prompts
Apache Licensed Advanced Video Generation Model
Animation Sketches sequence Colorization
Aesthetically Controllable Text-Driven Stylization w/o Train
Long video understanding with smart attention
Extraction & Reconstruction for Efficient Speech Separation
Every image has a soundtrack
Get a music sample inspired by the mood of an image
Speech generation from text and acoustic reference
Easily expand image boundaries