Technology Review - Published By MIT
Advertisement

How Google's Ear Hears

The new voice-search application for the iPhone marks a milestone for spoken interfaces.

By Kate Greene

Thursday, November 20, 2008

smaller text tool iconmedium text tool iconlarger text tool icon

If you own an iPhone, you can now be part of one of the most ambitious speech-recognition experiments ever launched. On Monday, Google announced that it had added voice search to its iPhone mobile application, allowing people to speak search terms into their phones and view the results on the screen.

Credit: Technology Review
Multimedia
video  Technology Review tests Google's voice search.

In designing the system, Google took on an enormous challenge. Where an automated airline reservation system, say, has to handle a relatively limited number of terms, a Web search engine must contend with any topic that anyone might ever want to research--literally.

Fortunately, Google also has a huge amount of data on how people use search, and it was able to use that to train its algorithms. If the system has trouble interpreting one word in a query, for instance, it can fall back on data about which terms are frequently grouped together.

Google also had a useful set of data correlating speech samples with written words, culled from its free directory service, Goog411. People call the service and say the name of a city and state, and then say the name of a business or category. According to Mike Cohen, a Google research scientist, voice samples from this service were the main source of acoustic data for training the system.

But the data that Google used to build the system pales in comparison to the data that it now has the chance to collect. "The nice thing about this application is that Google will collect all this speech data," says Jim Glass, a principal research scientist at MIT. "And by getting all this data, they will improve their recognizer even more."

Mobile phones are assuming more and more computational duties; in much of the world, they're people's only computers. But their small screens and awkward keyboards can make text-intensive actions, like Web search, frustrating. While mobile browsers are getting better at predicting your search terms, and thereby reducing the amount of typing, nothing is quite as easy as speaking directly into the phone.

Story continues below

Speech-recognition systems, however, remain far from perfect. And people's frustration skyrockets when they can't find their way out of a voice-menu maze. But Google's implementation of speech recognition deftly sidesteps some of the technology's shortcomings, says Glass.

"The beauty of search engines is that they don't have to be exactly right," he says. When a user submits a spoken query, he says, Google's algorithms "just take it and stick it in a search engine, which puts the onus on the user to select the right result or try again." Because people are already used to refining their queries as they conduct Web searches, Glass says, they're more tolerant of imperfect results.

Comments

  • Google Ears
    This is just code. Couldn't it be dropped onto laptops? I'm disabled. If I could speak to my computer it would help significantly.

    dib
    Rate this comment: 12345

    dib
    11/20/2008
    Posts:9
    Avg Rating:
    2/5
    • Re: Google Ears
      If you are disabled, you should definitely look at tazti speech recognition (http://www.tazti.com).  It does a lot things that are very helpful, including searching not just Google, but Yahoo, eBay, etc.  Also, it's free which you cannot beat!  They have a video which explains how it works. 
      Rate this comment: 12345

      adamcomputes
      11/21/2008
      Posts:1
      Avg Rating:
      1/5

  • Daniel Tunke...
    11/20/2008
    Posts:5
    Avg Rating:
    4/5
  • iPhones Speech Recognition App
    I have done my A-levels about 7 years ago and I know there were speech recognition systems for special people.

    @Google Ears: i have checked the one you mentioned, it seems to be a very good speech recognition computer software for disabled people.

    Speech recognition system for iPhone will be a blast because iPhone is almost a handy computer withh the access to the world though Wi-Fi or Edge, connecting with people, send messages, business, fun, entertainment and a lot more. If the speech recognition system for iPhone works good then iPhone as s product will be recommend to handy cap people and this will open new horizons for business of developing iphone accessories for special/hand caped people.
    Rate this comment: 12345

    ronnie.willi...
    10/07/2009
    Posts:4

Log In

Forgot your password?     Register »
Advertisement

Videos

Making 3D Maps on the Move
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.