Google researchers have made an AI that can generate minutes-long musical pieces from text prompts, and can even transform a whistled or hummed melody into other instruments. The model is called MusicLM and while you can’t play around with it for yourself, the company has uploaded a bunch of samples that it produced using the model.
There are 30-second snippets of what sound like actual songs created from paragraph-long descriptions that prescribe a genre, vibe, and even specific instruments, as well as five-minute-long pieces generated from one or two words like “melodic techno.” Perhaps my favorite is a demo of “story mode,” where the model is basically given a script to morph between prompts.
Read more – https://www.theverge.com/2023/1/28/23574573/google-musiclm-text-to-music-ai
For example, this prompt:
electronic song played in a videogame (0:00-0:15)
meditation song played next to a river (0:15-0:30)
fire (0:30-0:45)
fireworks (0:45-0:60)
Below is the music generated according to this text.