Skip to Content

This robot can probably beat you at Jenga—thanks to its understanding of the world

Industrial machines could be trained to be less clumsy if we gave them a sense of touch and a better sense of real-world physics.
January 30, 2019

Despite dazzling advances in AI, robots are still horribly ham-fisted.

Increasingly, researchers and companies are turning to machine learning to make them more adaptive and dexterous. This typically means feeding the robot a video of what’s in front of it and asking it to work out how it should move in order to manipulate that object. For instance, researchers at OpenAI, a nonprofit in San Francisco, taught a robotic hand to manipulate a child’s block in this way.

But humans, of course, use more than just their eyes to learn how to handle objects. Vision is combined with a sense of touch—and we learn, early on, that objects positioned unstably will probably fall over.

That is what inspired a new robot, developed by Nima Fazeli and his colleagues at MIT, that has been given a fundamental understanding of the real world’s physics—and a usable sense of touch.

It proved how nimble-fingered it is by mastering Jenga, a game that involves removing blocks from a precariously assembled tower, ideally without causing it to topple over. The robot also displayed a kind of ingenuity that is crucial for human players: judging which block it can remove without making the tower fall down.

The research draws from several key ideas developed by Josh Tenenbaum, in the Department of Brain and Cognitive Sciences at MIT, and his research on human cognition. This includes the idea that humans develop an intuitive understanding of physics from an early age, and that probability is key to reasoning about the world. This differs from a lot of AI research today, which revolves around feeding as much data as possible to very large, or “deep,” neural networks.

The robot, equipped with force sensors as well as cameras, learns to play Jenga by poking and prodding blocks and using visual and tactile feedback to train a physics model of the world.

Then, when faced with a new tower of blocks, it used the model to infer, probabilistically, which block it should try to poke out of the tower next. You can see how good it was in the video above.

Animated gif of robot arm playing Jenga
Science Robotics

By combining vision, touch, and this model of real-world physics, the robot can learn to play Jenga more efficiently than would be possible otherwise. The intuitive physics model also lets the robot understand quickly that a block hanging over an edge will most probably fall. In testing, the approach outperformed conventional machine-learning methods. The research is published today in the journal Science Robotics.

This more humanlike learning technique could help make factory and warehouse robots far more capable. If that fails, they could at least challenge you to a fun party game. 

Deep Dive


The race to destroy PFAS, the forever chemicals 

Scientists are showing these damaging compounds can be beat.

How scientists are being squeezed to take sides in the conflict between Israel and Palestine

Tensions over the war are flaring on social media—with real-life ramifications.

Capitalizing on machine learning with collaborative, structured enterprise tooling teams

Machine learning advances require an evolution of processes, tooling, and operations.

Stay connected

Illustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at with a list of newsletters you’d like to receive.