Technology Review - Published By MIT
Advertisement

The Search for Voice Activation

Google's voice interface patent gives life to rumors that voice-actived mobile search will soon be a reality.

By Kate Greene

Friday, April 21, 2006

smaller text tool iconmedium text tool iconlarger text tool icon

Google is mum on how -- if at all -- it plans to use its recently granted patent for a voice-enabled search engine -- despite the fact that it has also hired several speech-recognition researchers.

Originally filed in February 2001, the patent was granted for "Voice interface for a search engine," in a move that likely signals some level of development at Google on voice technology for searching the Web using handhelds. Further fueling speculation is Google's poaching of several speech-recognition specialists -- the kind of move that often signals that a new product is afoot.

"They've put together a very strong group of people who are experts in speech-recognition technology," says Nelson Morgan, director of the International Computer Science Institute, which is affiliated with the University of California at Berkeley. He explains that Google picked up a number of engineers from Nuance, "the big gun in speech-recognition technology," which recently lost researchers after it was purchased by Scansoft, a speech-technology company (the combined company is called Nuance).

Investing in voice-search technology is a prescient move by Google, since the mobile Web appears to be at a tipping point. Globally, 28 percent of all mobile-phone users now surf the Web (up three percent over last year), according to Ipsos Insight, a Paris-based market research firm. In the United States, three out of every four homes now have mobile phones, and increasingly those phones are being used for more than just talking. In 2005, over half (52 percent) of all mobile-phone users had either sent or received a text message and 37 percent had sent or received an e-mail message.

Despite this growth, though, searching the Internet on handhelds can be maddening because of tiny keyboards and unfriendly designs, which require, at best, a stylus or, at worst, a series of clicks through numerous menu screens. Speech recognition software may provide a better way. And a handful of companies are exploring the technology. San Diego's V-Enable offers voice search services to help users find ring-tone listings, and Promptu of Menlo Park, CA plans to offer similar services this summer.

At their most fundamental, speech recognition systems are composed of three major functions. First, words are captured and translated into a digital signal. Then a speech-recognition algorithm compares those signals to words and phrases from a pre-set dictionary, of, say, ring-tone options or movie listings. Finally, the software offers the most likely match for the spoken phrase.

Comments

  • Super Concept
    Certainly this could get big in terms of "Local Search" and maps based applications. Where people can query by voice about a particular thing they are looking for somewhere and they get back detailed instructions, description or whatever the information with power of Google search engine.

    http://www.Suchit-Tiwari.Org
    Rate this comment: 12345
    Guest (Suchit)
    04/21/2006
    Posts:1
    • Accent
      Both NOISE POLLUTION [New] LAWS and/or Lack of laws and Accents will be in the way
      However, I keenly await the day when I can use this life saving technology. Go Google!
      Rate this comment: 12345
      Guest (Bibi Kaur)
      04/21/2006
      Posts:1
  • Hurdles
    As anyone who has used voice activated phone system knows - this technology looks good but will be hard to implement. It has a frustration factor that will be hard to overcome. No one I know has the patience to parse the voice options to get to a result. Doing a search on a computer is different because you see a screenful of results from which you can pick quickly what you want or move on.
    Rate this comment: 12345
    Guest (Sastri)
    04/21/2006
    Posts:1
  • Download Voice Tools from Voice Web Solutions
    Speech activate web pages at VoiceWebSolutions.net
    Rate this comment: 12345
    Guest (Brian Ty Graham)
    04/21/2006
    Posts:1
  • Voice Activation
    Human Speach is a series of tones at given frequency and amplitudes.  Certain residual background noises emulate what sounds like conversation from a distance.  It is suprising how close sounding to speach noise sources can become.  As with anything it is how one acclimates to the tonal quality that brings understanding.  Think of a bus station loudspeaker system, no one can understand what is broadcast when first exposed.  After a bit of time, however, an individual will atune to the to the altered frequency and modulation and begin to discriminate words.  Look further at how humans adapt to learn understand what machines must do to achieve learning.
    Rate this comment: 12345
    Guest (Bewulf)
    04/28/2006
    Posts:1

Log In

Forgot your password?     Register »
Advertisement

Videos

Malleable Maps, Artistic Robots and Bubble Interfaces
Technology Review January/February 2010

Current Issue

Security in the Ether
Information technology's next grand challenge will be to secure the cloud--and prove we can trust it.
Advertisement
Advertisement
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2010 Technology Review. All Rights Reserved.