This robot can probably beat you at Jenga—thanks to its understanding of the world

Industrial machines could be trained to be less clumsy if we gave them a sense of touch and a better sense of real-world physics.

Will Knightarchive page

January 30, 2019

SCIENCE ROBOTICS / COVER IMAGE: JOHN HOPKINS UNIVERSITY, WILL KIRK / HOMEWOOD PHOTOGRAPHYScience Robotics / Cover image: John Hopkins University, WILL KIRK / HOMEWOOD PHOTOGRAPHY

Despite dazzling advances in AI, robots are still horribly ham-fisted.

Increasingly, researchers and companies are turning to machine learning to make them more adaptive and dexterous. This typically means feeding the robot a video of what’s in front of it and asking it to work out how it should move in order to manipulate that object. For instance, researchers at OpenAI, a nonprofit in San Francisco, taught a robotic hand to manipulate a child’s block in this way.

But humans, of course, use more than just their eyes to learn how to handle objects. Vision is combined with a sense of touch—and we learn, early on, that objects positioned unstably will probably fall over.

That is what inspired a new robot, developed by Nima Fazeli and his colleagues at MIT, that has been given a fundamental understanding of the real world’s physics—and a usable sense of touch.

It proved how nimble-fingered it is by mastering Jenga, a game that involves removing blocks from a precariously assembled tower, ideally without causing it to topple over. The robot also displayed a kind of ingenuity that is crucial for human players: judging which block it can remove without making the tower fall down.

The research draws from several key ideas developed by Josh Tenenbaum, in the Department of Brain and Cognitive Sciences at MIT, and his research on human cognition. This includes the idea that humans develop an intuitive understanding of physics from an early age, and that probability is key to reasoning about the world. This differs from a lot of AI research today, which revolves around feeding as much data as possible to very large, or “deep,” neural networks.

The robot, equipped with force sensors as well as cameras, learns to play Jenga by poking and prodding blocks and using visual and tactile feedback to train a physics model of the world.

Then, when faced with a new tower of blocks, it used the model to infer, probabilistically, which block it should try to poke out of the tower next. You can see how good it was in the video above.

Animated gif of robot arm playing Jenga — Science Robotics

By combining vision, touch, and this model of real-world physics, the robot can learn to play Jenga more efficiently than would be possible otherwise. The intuitive physics model also lets the robot understand quickly that a block hanging over an edge will most probably fall. In testing, the approach outperformed conventional machine-learning methods. The research is published today in the journal Science Robotics.

This more humanlike learning technique could help make factory and warehouse robots far more capable. If that fails, they could at least challenge you to a fun party game.

Keep Reading

Large language models can do jaw-dropping things. But nobody knows exactly why.

And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.

Will Douglas Heavenarchive page

How scientists traced a mysterious covid case back to six toilets

When wastewater surveillance turns into a hunt for a single infected individual, the ethics get tricky.

Cassandra Willyardarchive page

The problem with plug-in hybrids? Their drivers.

Plug-in hybrids are often sold as a transition to EVs, but new data from Europe shows we’re still underestimating the emissions they produce.

Casey Crownhartarchive page

Google DeepMind’s new generative model makes Super Mario–like games from scratch

Genie learns how to control games by watching hours and hours of video. It could help train next-gen robots too.

Will Douglas Heavenarchive page

Stay connected

Illustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

This robot can probably beat you at Jenga—thanks to its understanding of the world

Keep Reading

Most Popular

Large language models can do jaw-dropping things. But nobody knows exactly why.

How scientists traced a mysterious covid case back to six toilets

The problem with plug-in hybrids? Their drivers.

Google DeepMind’s new generative model makes Super Mario–like games from scratch

Stay connected

Get the latest updates from
MIT Technology Review

The latest iteration of a legacy

Advertise with MIT Technology Review

About

Help

Keep Reading

Most Popular

Large language models can do jaw-dropping things. But nobody knows exactly why.

How scientists traced a mysterious covid case back to six toilets

The problem with plug-in hybrids? Their drivers.

Google DeepMind’s new generative model makes Super Mario–like games from scratch

Stay connected

Get the latest updates fromMIT Technology Review

Get the latest updates from
MIT Technology Review