Web

The Search for Voice Activation

(Page 2 of 2)

  • Friday, April 21, 2006
  • By Kate Greene

Voice-activated systems, while getting better, can still sometimes get the answers wrong, explains Morgan. The software matches the voice input to a number of possibilities and often asks the user if the system's highest-ranked word was the one he or she intended, or it might say the word was unintelligible. These systems work best when the algorithm needs to access only a small dictionary, he adds, such as ring-tone options, movie listings, or phone-book contacts.

That methodology, though, would be impractical for Web searches, since it's impossible to limit the search dictionary while providing access to the eight billion-plus websites that Google searches.

Google's voice search patent approaches the problem by taking a step back from simply plugging standard voice-recognition technology into standard search technology, says Morgan. Instead of trying to accurately predict the single-best guess about what a person is saying, the technology would take a handful of word and phrase possibilities and throw them at the powerful Google search engine. In this way, the voice search system may not need the most accurate speech recognition technology. Instead, it relies on Google's strength -- its search algorithm -- to supply the most likely result for a number of possibilities.

This strategy of outsourcing the translation and searching to remote Google servers has its benefits, says Jordan Cohen, senior scientist at SRI International in Palo Alto, CA. It reduces the level of complexity in the software on the phone, and therefore wouldn't use as much processing power, memory, or energy. Instead, it relies on the strength of the network -- and some faith that the technology would be able to deal with the uncertainty of both vocal input and an intended search. If speech technology is instead confined to the mobile device, Cohen says, the software can "count on the person to fix things up" when the algorithm can't find the most suitable word.

Because mobile voice search has barely begun to take form, it remains an open question whether Google's approach will be employed -- or produce the best product. But the market for mobile search in general is "going to be immense," says Cohen. "The Google patent is an attempt to stake out a claim in that space."

Print

Related Articles

Can Google Get Web Users Talking?

Voice-driven search is a futuristic idea, and may take some getting used to.

How Google's Ear Hears

The new voice-search application for the iPhone marks a milestone for spoken interfaces.

Talk to the Phone

Speech-recognition software from Vlingo could make the mobile Web easier to use.

Close Comments

To comment, please sign in or register

Forgot my password

Guest (Suchit)

  • 2125 Days Ago
  • 04/21/2006

Super Concept

Certainly this could get big in terms of "Local Search" and maps based applications. Where people can query by voice about a particular thing they are looking for somewhere and they get back detailed instructions, description or whatever the information with power of Google search engine.

http://www.Suchit-Tiwari.Org

Reply

Guest (Bibi Kaur)

  • 2125 Days Ago
  • 04/21/2006

Accent

Both NOISE POLLUTION [New] LAWS and/or Lack of laws and Accents will be in the way
However, I keenly await the day when I can use this life saving technology. Go Google!

Reply

Guest (Sastri)

  • 2125 Days Ago
  • 04/21/2006

Hurdles

As anyone who has used voice activated phone system knows - this technology looks good but will be hard to implement. It has a frustration factor that will be hard to overcome. No one I know has the patience to parse the voice options to get to a result. Doing a search on a computer is different because you see a screenful of results from which you can pick quickly what you want or move on.

Reply

Guest (Brian Ty Graham)

  • 2125 Days Ago
  • 04/21/2006

Download Voice Tools from Voice Web Solutions

Speech activate web pages at VoiceWebSolutions.net

Reply

Guest (Bewulf)

  • 2118 Days Ago
  • 04/28/2006

Voice Activation

Human Speach is a series of tones at given frequency and amplitudes.  Certain residual background noises emulate what sounds like conversation from a distance.  It is suprising how close sounding to speach noise sources can become.  As with anything it is how one acclimates to the tonal quality that brings understanding.  Think of a bus station loudspeaker system, no one can understand what is broadcast when first exposed.  After a bit of time, however, an individual will atune to the to the altered frequency and modulation and begin to discriminate words.  Look further at how humans adapt to learn understand what machines must do to achieve learning.

Reply

Advertisement

MAGAZINE

Can We Build Tomorrow's Breakthroughs?

Manufacturing in the United States is in trouble. That's bad news not just for the country's economy but for the future of innovation.

Videos

A Social-Media Decoder

More

Advertisement

Technology Review Lists

TR50

Our list of the 50 most innovative companies, including the following:

Netflix

American Superconductor

Roche

Lyric Semiconductor

More

Advertisement

Facebook

Advertisement