With the launch of Riffusion, I had to test out some of the prompts in my post about prompt-based music generation: https://thisisimportant.net/posts/prompt-based-music-generation/
Will we see prompt-based music generation?

An examination of the current state, challenges, and resources needed to create a prompt-based music generation tool like DALL-E 2, but for music.

First up, the UK garage track with trumpets. It's pretty clear that the model doesn't really get "uk garage" or "trumpets" as prompts.
https://soundcloud.com/denyinghipster/riffusion-prompt-uk-garage
Riffusion Prompt: UK Garage Track With Trumpets

Testing out Riffusion with the prompts from my blog post about prompt-based music generation: https://thisisimportant.net/posts/prompt-based-music-generation/

SoundCloud
Next I tried a more descriptive yet lossy prompt, "bright vocals sung by a feminine voice and a guitar solo in the middle":
https://soundcloud.com/denyinghipster/riffusion-prompt-bright-vocals
Riffusion Prompt: Bright Vocals Sung By A Feminine Voice And A Guitar Solo In The Middle

Testing out Riffusion with the prompts from my blog post about prompt-based music generation: https://thisisimportant.net/posts/prompt-based-music-generation/

SoundCloud
The word "bright" worked okay, but I'm not sure I'd describe the sounds I hear as a guitar. Unsurprisingly, vocals aren't a thing the model can tackle.

Then I tried the most detailed prompt I considered in my post:
Melancholy breakup song with acoustic guitars in the style of Taylor Swift’s folklore album with lyrics about a queer couple

https://soundcloud.com/denyinghipster/prompt-melancholy-breakup-song

My guess is that the taylor swift comparison really helped out the model here, but WOW does it fall apart in terms of rhythm and meter.

Prompt: Melancholy Breakup Song With Acoustic Guitars In The Style Of Taylor Swift’s Folklore Album

Testing out Riffusion with the prompts from my blog post about prompt-based music generation: https://thisisimportant.net/posts/prompt-based-music-generation/

SoundCloud

The next prompt also uses an artist comparison.

"Short vocal track with flute that sounds like Godford"

the model was able to produce something vaguely high pitched and with a higher bpm, but no discernible flute and the backbeat hiccups.

https://soundcloud.com/denyinghipster/riffusion-prompt-short-vocal

Riffusion Prompt: Short Vocal Track With Flute That Sounds Like Godford

Testing out Riffusion with the prompts from my blog post about prompt-based music generation: https://thisisimportant.net/posts/prompt-based-music-generation/

SoundCloud

the most detailed and perhaps highbrow prompt is also pretty nonsensical in terms of what I might expect to hear:

"High instrumentality and energy with bright timbre 120 BPM in the key of C minor"

https://soundcloud.com/denyinghipster/riffusion-prompt-high

turns out this results in something that sounds like an organ just pressing and holding a C-minor chord. At least I'm pretty sure it's a C-minor chord.

https://www.key-notes.com/blog/c-minor-chord

Riffusion Prompt: High Instrumentality And Energy With Bright Timbre 120 BPM In The Key Of C Minor

Testing out Riffusion with the prompts from my blog post about prompt-based music generation: https://thisisimportant.net/posts/prompt-based-music-generation/

SoundCloud

Finally, in honor of the Switched On Pop book I just started reading, I decided to revisit the folk song prompt with some helpful meter guidance:

"melancholy folk song with meter in 3"

https://soundcloud.com/denyinghipster/riffusion-prompt-melancholy

It sounds kind of like an incoherent circus noise that you'd get from playing a music box.

Riffusion Prompt: Melancholy Folk Song With Meter In 3

Testing out Riffusion with the prompts from my blog post about prompt-based music generation: https://thisisimportant.net/posts/prompt-based-music-generation/

SoundCloud

Seems like as predicted, we have a long way to go before we can produce coherent music from prompts. Rules-based systems will probably continue to... rule... for quite some time.

Try out Riffusion for yourself on HuggingFace:
https://huggingface.co/spaces/fffiloni/spectrogram-to-music

And check out my thoughts on my blog:
https://thisisimportant.net/posts/prompt-based-music-generation/

Riffusion • Spectrogram To Music - a Hugging Face Space by fffiloni

Discover amazing ML apps made by the community