Google says its new chatbot Meena is the best in the world

Google has released a neural-network-powered chatbot called Meena that it claims is better than any other chatbot out there.
Data slurp: Meena was trained on a whopping 341 gigabytes of public social-media chatter—8.5 times as much data as OpenAI’s GPT-2. Google says Meena can talk about pretty much anything, and can even make up (bad) jokes.
Why it matters: Open-ended conversation that covers a wide range of topics is hard, and most chatbots can’t keep up. At some point most say things that make no sense or reveal a lack of basic knowledge about the world. A chatbot that avoids such mistakes will go a long way toward making AIs feel more human, and make characters in video games more lifelike.
Sense and specificity: To put Meena to the test, Google has developed a new metric it calls the Sensibleness and Specificity Average (SSA), which captures important attributes for natural conversations, such as whether each utterance makes sense in context—which many chatbots can do—and is specific to what has just been said, which is harder.
What do you mean? For example, if you say “I like tennis” and a chatbot replies “That’s nice,” the response makes sense but is not specific. Many chatbots rely on tricks like this to hide the fact that they don’t know what you’re talking about. On the other hand, a response such as “Me too—I can’t get enough of Roger Federer” is specific. Google used crowdworkers to generate sample conversations and to score utterances in around 100 conversations. Meena got an SSA score of 79%, compared with 56% for Mitsuku, a state-of-the-art chatbot that has won the Loebner Prize for the last four years. Even human conversation partners only scored 86% in this new test.
Can I talk to Meena? Not yet. Google says it won’t be releasing a public demo until it has vetted the model for safety and bias, which is probably a good thing. When Microsoft released its chatbot Tay on Twitter in 2016, it started spewing racist, misogynistic invective within hours.
Deep Dive
Artificial intelligence
A Roomba recorded a woman on the toilet. How did screenshots end up on Facebook?
Robot vacuum companies say your images are safe, but a sprawling global supply chain for data from our devices creates risk.
The viral AI avatar app Lensa undressed me—without my consent
My avatars were cartoonishly pornified, while my male colleagues got to be astronauts, explorers, and inventors.
Roomba testers feel misled after intimate images ended up on Facebook
An MIT Technology Review investigation recently revealed how images of a minor and a tester on the toilet ended up on social media. iRobot said it had consent to collect this kind of data from inside homes—but participants say otherwise.
How to spot AI-generated text
The internet is increasingly awash with text written by AI software. We need new tools to detect it.
Stay connected
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.