Web

The Search for Voice Activation

Google's voice interface patent gives life to rumors that voice-actived mobile search will soon be a reality.

  • Friday, April 21, 2006
  • By Kate Greene

Google is mum on how -- if at all -- it plans to use its recently granted patent for a voice-enabled search engine -- despite the fact that it has also hired several speech-recognition researchers.

Originally filed in February 2001, the patent was granted for "Voice interface for a search engine," in a move that likely signals some level of development at Google on voice technology for searching the Web using handhelds. Further fueling speculation is Google's poaching of several speech-recognition specialists -- the kind of move that often signals that a new product is afoot.

Advertisement

"They've put together a very strong group of people who are experts in speech-recognition technology," says Nelson Morgan, director of the International Computer Science Institute, which is affiliated with the University of California at Berkeley. He explains that Google picked up a number of engineers from Nuance, "the big gun in speech-recognition technology," which recently lost researchers after it was purchased by Scansoft, a speech-technology company (the combined company is called Nuance).

Investing in voice-search technology is a prescient move by Google, since the mobile Web appears to be at a tipping point. Globally, 28 percent of all mobile-phone users now surf the Web (up three percent over last year), according to Ipsos Insight, a Paris-based market research firm. In the United States, three out of every four homes now have mobile phones, and increasingly those phones are being used for more than just talking. In 2005, over half (52 percent) of all mobile-phone users had either sent or received a text message and 37 percent had sent or received an e-mail message.

Despite this growth, though, searching the Internet on handhelds can be maddening because of tiny keyboards and unfriendly designs, which require, at best, a stylus or, at worst, a series of clicks through numerous menu screens. Speech recognition software may provide a better way. And a handful of companies are exploring the technology. San Diego's V-Enable offers voice search services to help users find ring-tone listings, and Promptu of Menlo Park, CA plans to offer similar services this summer.

At their most fundamental, speech recognition systems are composed of three major functions. First, words are captured and translated into a digital signal. Then a speech-recognition algorithm compares those signals to words and phrases from a pre-set dictionary, of, say, ring-tone options or movie listings. Finally, the software offers the most likely match for the spoken phrase.

Print

Related Articles

Can Google Get Web Users Talking?

Voice-driven search is a futuristic idea, and may take some getting used to.

How Google's Ear Hears

The new voice-search application for the iPhone marks a milestone for spoken interfaces.

Talk to the Phone

Speech-recognition software from Vlingo could make the mobile Web easier to use.

Close Comments

To comment, please sign in or register

Forgot my password

Guest (Suchit)

  • 2125 Days Ago
  • 04/21/2006

Super Concept

Certainly this could get big in terms of "Local Search" and maps based applications. Where people can query by voice about a particular thing they are looking for somewhere and they get back detailed instructions, description or whatever the information with power of Google search engine.

http://www.Suchit-Tiwari.Org

Reply

Guest (Bibi Kaur)

  • 2125 Days Ago
  • 04/21/2006

Accent

Both NOISE POLLUTION [New] LAWS and/or Lack of laws and Accents will be in the way
However, I keenly await the day when I can use this life saving technology. Go Google!

Reply

Guest (Sastri)

  • 2125 Days Ago
  • 04/21/2006

Hurdles

As anyone who has used voice activated phone system knows - this technology looks good but will be hard to implement. It has a frustration factor that will be hard to overcome. No one I know has the patience to parse the voice options to get to a result. Doing a search on a computer is different because you see a screenful of results from which you can pick quickly what you want or move on.

Reply

Guest (Brian Ty Graham)

  • 2125 Days Ago
  • 04/21/2006

Download Voice Tools from Voice Web Solutions

Speech activate web pages at VoiceWebSolutions.net

Reply

Guest (Bewulf)

  • 2118 Days Ago
  • 04/28/2006

Voice Activation

Human Speach is a series of tones at given frequency and amplitudes.  Certain residual background noises emulate what sounds like conversation from a distance.  It is suprising how close sounding to speach noise sources can become.  As with anything it is how one acclimates to the tonal quality that brings understanding.  Think of a bus station loudspeaker system, no one can understand what is broadcast when first exposed.  After a bit of time, however, an individual will atune to the to the altered frequency and modulation and begin to discriminate words.  Look further at how humans adapt to learn understand what machines must do to achieve learning.

Reply

Advertisement

MAGAZINE

Can We Build Tomorrow's Breakthroughs?

Manufacturing in the United States is in trouble. That's bad news not just for the country's economy but for the future of innovation.

Videos

A Social-Media Decoder

More

Advertisement

Technology Review Lists

TR50

Our list of the 50 most innovative companies, including the following:

Goldwind Science and Technology

Nissan

Applied Materials

Crowdcast

More

Advertisement

Facebook

Advertisement