The US Army is creating robots that can follow orders

For robots to be useful teammates, they need to be able to understand what they’re told to do—and execute it with minimal supervision.

David Hamblingarchive page

November 6, 2019

A clearpath robot under remote controlClearpath Robotics

Military robots have always been pretty dumb. The PackBot the US Army uses for inspections and bomb disposal, for example, has practically no onboard intelligence and is piloted by remote control. What the Army has long wanted instead are intelligent robot teammates that can follow orders without constant supervision.

That is now a step closer. The Army’s research lab has developed software that lets robots understand verbal instructions, carry out a task, and report back. The potential rewards are tremendous. A robot that can understand commands and has a degree of machine intelligence would one day be able to go ahead of troops and check for IEDs or ambushes. It could also reduce the number of human soldiers needed on the ground.

“Even self-driving cars don’t have a high enough level of understanding to be able to follow instructions from another person and carry out a complex mission,” says Nicholas Roy of MIT, who was part of the team behind the project. “But our robot can do exactly that.”

Roy has been working on the problem as part of the Robotics Collaborative Technology Alliance, a 10-year project led by the Army Research Laboratory (ARL). The project team included researchers from MIT and Carnegie Mellon working alongside government institutions like NASA’s Jet Propulsion Laboratory and robotics firms such as Boston Dynamics. The program finished last month, with a series of events to show off what it had achieved. A number of robots were put through their paces, showing off their manipulation skills, mobility over obstacles, and ability to follow verbal instructions.

The idea is that they are able to work with people more effectively—not unlike a military dog. “The dog is a perfect example of what we’re aiming for in terms of teaming with humans,” says project leader Stuart Young. Like a dog, the robot can take verbal instructions and interpret gestures. But it can also be controlled via a tablet and return data in the form of maps and images so the operator can see exactly what is behind the building, for example.

The team used a hybrid approach to help robots make sense of the world around them. Deep learning is particularly good at image recognition, so algorithms similar to those Google uses to recognize objects in photos let the robots identify buildings, vegetation, vehicles, and people. Senior ARL roboticist Ethan Stump says that as well as identifying whole objects, a robot running the software can recognize key points like the headlights and wheels of a car, helping them work out the car’s exact position and orientation.

Once it has used deep learning to identify an object, the robot uses a knowledge base to pull out more detailed information that helps it carry out its orders. For example,when it identifies an object as a car, it consults a list of facts relating to cars: a car is a vehicle, it has wheels and an engine, and so on. These facts need to be hand-coded and are time consuming to compile, however, and Stump says the team is looking into ways to streamline this. (Others are looking at similar challenges: DARPA’s “Machine Common Sense” (MCS) program is combining deep learning with a knowledge-base-centered approach so a robot can learn and show something like human judgment.)

Deep Dive

Artificial intelligence

Large language models can do jaw-dropping things. But nobody knows exactly why.

And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.