Technology Review - Published By MIT
Advertisement

The Grammar of Sound

Continued from page 1

By John Harney

April 30, 2003

smaller text tool iconmedium text tool iconlarger text tool icon

Speed is not the only advantage that the Fast-Talk method has over alternative methods. With speech recognition, says Clements, "your goal is to take audio that's input speech and either recognize what was spoken according to some very constrained grammar, or you can use a natural language approach and try to find the sequence of words that is most probable." If the term is not in a system's spoken lexicon, however, or if you are uncertain about the sequence of words in text form you're looking for, the search may prove fruitless. By contrast, Clements says, the Fast-Talk approach "processes the speech in such a way that you can later go back and search it very efficiently for any set of sounds-they don't have to have any lexical existence at all."

Fenn believes this is a much more versatile approach. Fast-Talk, she says, is "focusing on the pure audio aspect" of spoken communications, which she says "seems to give them a faster algorithm and greater flexibility in dealing with new items that aren't in the vocabulary." Fast-Talk also handles accents well. The key is having the software practice recognizing phonemes in the accent with which you'll be dealing. For instance, a system trained by a speaker from Canada would transcribe the sound "hoos" into the word "house."

Fast-Talk's software does not, however, employ certain strategies that natural language queries might-leading to some shortcomings. Proximity searches-for instance, where a natural language tool would recognize a word like "Georgia," because it usually occurs right after the word, "Atlanta"-are not possible. Also, since Fast-Talk recognizes by sound, not spelling, it cannot distinguish homonyms. "We would not be able to tell the difference between the word, discreet' as in cautious versus discrete' as in individual items," says Clements. Another disadvantage, cautions Fenn, is that the system is not amenable to text mining. "If you wanted to look for patterns and clusters of related concepts, you couldn't. You'd have to have a transcript."

Those qualifications aside, however, the technology appears to offer significant benefit in several applications. Television and radio networks have thousands of hours of programming but no fast way to index and negotiate them. "If you want to find where, say, an NPR news account talked about a panda," says Clements, "it just takes forever to do that right now." Another potentially hot use is in call centers. Rasmus says "call centers want to know if anyone had a conversation about X kind of product. Looking at those voice recordings as a means for getting information to somebody at a call center who's trying to help a client is incredibly time saving, he says. Office workers might ultimately find the technology useful too, Clements says. "Imagine that you have all your voicemail as audio files interleaved with all of your e-mail," he says. "Our tool would make it so you could manage it."

Comments

Log In

Forgot your password?     Register »
Advertisement

Videos

Making 3D Maps on the Move
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.