Baidu has a new neural-network-powered system that is amazingly good at cloning voices.
Mic check: To re-create a voice, AI typically needs to listen to hours of recordings of someone talking. But as New Scientist reports, a new process could get that down to one minute. Baidu researchers have unveiled an upgraded version of Deep Voice, their text-to speech synthesis system, that can now, once trained, clone any voice after listening to a few snippets of audio.
Details: The more samples Deep Voice hears, the better the results, but just 10 samples of less than five seconds each were enough for it to produce a synthetic voice that could fool a voice-recognition system more than 95 percent of the time. Baidu hosted some of the voice-cloning samples here for anyone to take a listen.
Of course there’s a downside: Technology like this could seriously undermine biometric security that uses someone’s voice as a security feature. People are already falling for e-mails “from” their friends—so what happens when it sounds like your mom calling and asking to borrow some money?
This artist is dominating AI-generated art. And he’s not happy about it.
Greg Rutkowski is a more popular prompt than Picasso.
What does GPT-3 “know” about me?
Large language models are trained on troves of personal data hoovered from the internet. So I wanted to know: What does it have on me?
An AI that can design new proteins could help unlock new cures and materials
The machine-learning tool could help researchers discover entirely new proteins not yet known to science.
DeepMind’s new chatbot uses Google searches plus humans to give better answers
The lab trained a chatbot to learn from human feedback and search the internet for information to support its claims.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.