Hello,

We noticed you're browsing in private or incognito mode.

To continue reading this article, please exit incognito mode or log in.

Not an Insider? Subscribe now for unlimited access to online articles.

Intelligent Machines

Cars May Soon Understand More of What You Say

It should soon be possible to give your car more complicated and natural verbal commands.

In the United States, nearly one in five road accidents involves some form of driver distraction.

Many cars now come with voice control, but you can’t really talk normally to such systems, and you often have to repeat a phrase to get the job done. That could change, however, with the introduction of voice interfaces that allow for a more natural back-and-forth between driver and dashboard.

“What we’re going to see in the very near future is the ability to have a dialogue,” says Charlie Ortiz, who is senior principal manager of the artificial intelligence and reasoning group at Nuance, a voice recognition technology company based in Burlington, Massachusetts. “You might say I want to listen to some Latin jazz, or suggest a particular musician.”

Ortiz says that such technology is now in the vehicle production pipeline, which means it may appear within a few years. It will primarily allow for more natural control of dashboard features and retrieval of information such as directions. “In the navigation domain, we’re developing methods to describe points of interest more abstractly,” he says. “I don’t always know the exact address of where I want to go. I want to be able to say ‘I want to go to a restaurant in the marina near the ballpark.’ “

Nuance came to dominate the market for voice-recognition technology over the past decade after acquiring various other companies in that space (see “Where Speech Recognition Is Going”). Thanks to new techniques and large quantities of training data, speech recognition has improved greatly over that time, and Nuance supplies the technology to companies across numerous industries. It already provides voice control technology to carmakers including Ford, Hyundai, and Chrysler.

Nuance is now looking to build on that by offering greater understanding of speech. This is notoriously difficult, though, because the meaning of words and sentences can vary dramatically depending on the context; and so dialogue usually needs to be carefully constrained within certain areas. Conducting more complex conversations is a major goal for the lab Ortiz runs at Nuance. His team is working to develop personal assistants capable of understanding more types of sentences and responding effectively when they do not comprehend. For instance, a query might refer to a previous discussion and require a subtle appreciation of its context. A user might ask such a system how a particular restaurant compares, in terms of user rating, to other restaurants in his or her search history.

And Ortiz believes that more fluent speech technology could be just around the corner, thanks to advances in parsing semantics. “The stars are aligning at just the right time,” he says. “There have been a lot of advances in various components—language-understanding and the reasoning back-end parts. One big challenge is to put these pieces together.”

Another key challenge, as far as the auto industry is concerned, is ensuring that more sophisticated interfaces aren’t also more distracting. More intuitive speech interfaces might be less taxing, but only if they work well.

“If it works perfectly, great. If it fails, you’re in a worse position,” says Bryan Reimer, a scientist at MIT’s Age Lab, whose research has shown that voice interfaces can be just as distracting as regular ones in cars. “The more complex and vague the commands, the more complex the recognition problem, and the higher damage of failure.”

Several carmakers contacted by MIT Technology Review declined to discuss how voice technology would likely evolve in their products. However, vehicle interfaces are advancing at an impressive pace, spurred on in part by mobile technology (see “Rebooting the Automobile”).

Hear more about speech recognition from the experts at the EmTech Digital Conference, March 26-27, 2018 in San Francisco.

Learn more and register

Uh oh–you've read all of your free articles for this month.

Insider Premium
$179.95/yr US PRICE

More from Intelligent Machines

Artificial intelligence and robots are transforming how we work and live.

Want more award-winning journalism? Subscribe to Insider Plus.
  • Insider Plus {! insider.prices.plus !}*

    {! insider.display.menuOptionsLabel !}

    Everything included in Insider Basic, plus the digital magazine, extensive archive, ad-free web experience, and discounts to partner offerings and MIT Technology Review events.

    See details+

    What's Included

    Unlimited 24/7 access to MIT Technology Review’s website

    The Download: our daily newsletter of what's important in technology and innovation

    Bimonthly print magazine (6 issues per year)

    Bimonthly digital/PDF edition

    Access to the magazine PDF archive—thousands of articles going back to 1899 at your fingertips

    Special interest publications

    Discount to MIT Technology Review events

    Special discounts to select partner offerings

    Ad-free web experience

/
You've read all of your free articles this month. This is your last free article this month. You've read of free articles this month. or  for unlimited online access.