When the Google self-driving-car project began about a decade ago, the company made a strategic decision to build its technology on expensive lidar and detailed mapping. Even today, Google’s self-driving technology still relies on those two pillars. While that approach is great up to a point—we have good algorithms for using lidar and camera data to localize a car on the map—it’s still not good enough. Driving on complicated, ever-changing streets involves perception and decision-making skills that are inherently uncertain (see “Your Driverless Ride Is Arriving”).
Now an artificial-intelligence technology called deep learning is being used to address the problem. Rather than using the old method of hand-coded algorithms, we can now use systems that program themselves by learning from examples of how a system ought to behave in response to an input. Deep learning is now the best approach to most perception tasks, as well as to many low-level control tasks.
A self-driving car needs a perception system to sense things that are moving (cars, people) as well as things that aren’t (lampposts, curbs). Self-driving vehicles detect dynamic objects using sensors such as cameras, laser scanners, and radar. Of these three, cameras are the cheapest, but they’re also used the least because it’s hard to translate images into detected objects. Using deep learning, we’re seeing dramatic improvements in the car’s ability to understand and make use of such images.
We’re also seeing significant gains from something called “multitask deep learning,” in which a system trained simultaneously to detect lane markings, cars, and pedestrians does better than three separate systems trained in isolation—since the single network can share information among the separate tasks.
Instead of relying entirely on a pre-computed map, the car can use the map as one of many data streams, combining it with sensor inputs to help it make decisions. (A neural network that knows from map data where crosswalks are, for example, can more accurately detect pedestrians trying to cross than one that relies solely on images.)
Deep learning can also alleviate one of the biggest issues identified by many who have ridden in a self-driving car—a “jerky” feel to the driving style, which sometimes leads to motion sickness. But a car trained using examples of humans driving can offer a ride that feels more natural.
It’s still early. But just as deep learning did with image search and voice recognition, it is likely to forever change the course of self-driving cars.
Carol Reiley is the cofounder of Drive.ai.
A quick guide to the most important AI law you’ve never heard of
The European Union is planning new legislation aimed at curbing the worst harms associated with artificial intelligence.
It will soon be easy for self-driving cars to hide in plain sight. We shouldn’t let them.
If they ever hit our roads for real, other drivers need to know exactly what they are.
Crypto is weathering a bitter storm. Some still hold on for dear life.
When a cryptocurrency’s value is theoretical, what happens if people quit believing?
Artificial intelligence is creating a new colonial world order
An MIT Technology Review series investigates how AI is enriching a powerful few by dispossessing communities that have been dispossessed before.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.