Baidu has a new neural-network-powered system that is amazingly good at cloning voices.
Mic check: To re-create a voice, AI typically needs to listen to hours of recordings of someone talking. But as New Scientist reports, a new process could get that down to one minute. Baidu researchers have unveiled an upgraded version of Deep Voice, their text-to speech synthesis system, that can now, once trained, clone any voice after listening to a few snippets of audio.
Details: The more samples Deep Voice hears, the better the results, but just 10 samples of less than five seconds each were enough for it to produce a synthetic voice that could fool a voice-recognition system more than 95 percent of the time. Baidu hosted some of the voice-cloning samples here for anyone to take a listen.
Of course there’s a downside: Technology like this could seriously undermine biometric security that uses someone’s voice as a security feature. People are already falling for e-mails “from” their friends—so what happens when it sounds like your mom calling and asking to borrow some money?
A Roomba recorded a woman on the toilet. How did screenshots end up on Facebook?
Robot vacuum companies say your images are safe, but a sprawling global supply chain for data from our devices creates risk.
The viral AI avatar app Lensa undressed me—without my consent
My avatars were cartoonishly pornified, while my male colleagues got to be astronauts, explorers, and inventors.
Roomba testers feel misled after intimate images ended up on Facebook
An MIT Technology Review investigation recently revealed how images of a minor and a tester on the toilet ended up on social media. iRobot said it had consent to collect this kind of data from inside homes—but participants say otherwise.
How to spot AI-generated text
The internet is increasingly awash with text written by AI software. We need new tools to detect it.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.