The news: In a fresh spin on manufactured pop, OpenAI has released a neural network called Jukebox that can generate catchy songs in a variety of different styles, from teenybop and country to hip-hop and heavy metal. It even sings—sort of.
How it works: Give it a genre, an artist, and lyrics, and Jukebox will produce a passable pastiche in the style of well-known performers, such as Katy Perry, Elvis Presley or Nas. You can also give it the first few seconds of a song and it will autocomplete the rest.
Old songs, new tricks: Computer-generated music has been a thing for 50 years or more, and AIs already have impressive examples of orchestral classical and ambient electronic compositions in their back catalogue. Video games often use computer-generated music in the background, which loops and crescendos on the fly depending on what the player is doing at the time. But it is much easier for a machine to generate something that sounds a bit like Bach than the Beatles. That’s because the mathematical underpinning of much classical music lends itself to the symbolic representation of music that AI composers often use. Despite being simpler, pop songs are different.
OpenAI trained Jukebox on 1.2 million songs, using the raw audio data itself rather than an abstract representation of pitch, instrument, or timing. But this required a neural network that could track so-called dependencies—a repeating melody, say—across the three or four minutes of a typical pop song, which is hard for an AI to do. To give a sense of the task, Jukebox keeps track of millions of time stamps per song, compared with the thousand time stamps that OpenAI’s language generator GPT-2 uses when keeping track of a piece of writing.
Chatbot sing-alongs: To be honest, it’s not quite there yet. You will notice that the results, while technically impressive, are pretty deep in the uncanny valley. But while we are still a long way from artificial general intelligence (OpenAI’s stated goal), Jukebox shows once again just how good neural networks are getting at imitating humans, blurring the line between what’s real and what’s not. This week, rapper Jay-Z started legal action to remove deepfakes of him singing Billy Joel songs, for example. OpenAI says it plans to conduct research into the implications of AI for intellectual -property rights.
A Roomba recorded a woman on the toilet. How did screenshots end up on Facebook?
Robot vacuum companies say your images are safe, but a sprawling global supply chain for data from our devices creates risk.
The viral AI avatar app Lensa undressed me—without my consent
My avatars were cartoonishly pornified, while my male colleagues got to be astronauts, explorers, and inventors.
Roomba testers feel misled after intimate images ended up on Facebook
An MIT Technology Review investigation recently revealed how images of a minor and a tester on the toilet ended up on social media. iRobot said it had consent to collect this kind of data from inside homes—but participants say otherwise.
How to spot AI-generated text
The internet is increasingly awash with text written by AI software. We need new tools to detect it.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.