Skip to Content
Artificial intelligence

This robot can probably beat you at Jenga—thanks to its understanding of the world

Industrial machines could be trained to be less clumsy if we gave them a sense of touch and a better sense of real-world physics.
January 30, 2019
SCIENCE ROBOTICS / COVER IMAGE: JOHN HOPKINS UNIVERSITY, WILL KIRK / HOMEWOOD PHOTOGRAPHYScience Robotics / Cover image: John Hopkins University, WILL KIRK / HOMEWOOD PHOTOGRAPHY

Despite dazzling advances in AI, robots are still horribly ham-fisted.

Increasingly, researchers and companies are turning to machine learning to make them more adaptive and dexterous. This typically means feeding the robot a video of what’s in front of it and asking it to work out how it should move in order to manipulate that object. For instance, researchers at OpenAI, a nonprofit in San Francisco, taught a robotic hand to manipulate a child’s block in this way.

But humans, of course, use more than just their eyes to learn how to handle objects. Vision is combined with a sense of touch—and we learn, early on, that objects positioned unstably will probably fall over.

That is what inspired a new robot, developed by Nima Fazeli and his colleagues at MIT, that has been given a fundamental understanding of the real world’s physics—and a usable sense of touch.

It proved how nimble-fingered it is by mastering Jenga, a game that involves removing blocks from a precariously assembled tower, ideally without causing it to topple over. The robot also displayed a kind of ingenuity that is crucial for human players: judging which block it can remove without making the tower fall down.

The research draws from several key ideas developed by Josh Tenenbaum, in the Department of Brain and Cognitive Sciences at MIT, and his research on human cognition. This includes the idea that humans develop an intuitive understanding of physics from an early age, and that probability is key to reasoning about the world. This differs from a lot of AI research today, which revolves around feeding as much data as possible to very large, or “deep,” neural networks.

The robot, equipped with force sensors as well as cameras, learns to play Jenga by poking and prodding blocks and using visual and tactile feedback to train a physics model of the world.

Then, when faced with a new tower of blocks, it used the model to infer, probabilistically, which block it should try to poke out of the tower next. You can see how good it was in the video above.

Animated gif of robot arm playing Jenga
Science Robotics

By combining vision, touch, and this model of real-world physics, the robot can learn to play Jenga more efficiently than would be possible otherwise. The intuitive physics model also lets the robot understand quickly that a block hanging over an edge will most probably fall. In testing, the approach outperformed conventional machine-learning methods. The research is published today in the journal Science Robotics.

This more humanlike learning technique could help make factory and warehouse robots far more capable. If that fails, they could at least challenge you to a fun party game. 

Deep Dive

Artificial intelligence

chasm concept
chasm concept

Artificial intelligence is creating a new colonial world order

An MIT Technology Review series investigates how AI is enriching a powerful few by dispossessing communities that have been dispossessed before.

open sourcing language models concept
open sourcing language models concept

Meta has built a massive new language AI—and it’s giving it away for free

Facebook’s parent company is inviting researchers to pore over and pick apart the flaws in its version of GPT-3

spaceman on a horse generated by DALL-E
spaceman on a horse generated by DALL-E

This horse-riding astronaut is a milestone in AI’s journey to make sense of the world

OpenAI’s latest picture-making AI is amazing—but raises questions about what we mean by intelligence.

labor exploitation concept
labor exploitation concept

How the AI industry profits from catastrophe

As the demand for data labeling exploded, an economic catastrophe turned Venezuela into ground zero for a new model of labor exploitation.

Stay connected

Illustration by Rose WongIllustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at customer-service@technologyreview.com with a list of newsletters you’d like to receive.