Technology Review - Published By MIT
Advertisement

The Grammar of Sound

New software lets you index and search audio much faster than in the past.

By John Harney

April 30, 2003

smaller text tool iconmedium text tool iconlarger text tool icon

Imagine you were a transcriptionist at the federal government's trial of Microsoft last year. Say you were trying to find instances of when Bill Gates testified between May 15 and June 1. Using existing tools like full-text search engines, natural language query or speech recognition, you'd have to transcribe the audio into a text file, then index it with a lexicon of terms that included "Gates." Such an undertaking would have been labor-intensive, time-consuming, and error-prone. But only then could congressmen quickly locate testimony in which they were interested.

The key to expediting the process was eliminating the need for transcription or indexing or both. This has long appeared to be an insoluble problem. But a company called Fast-Talk Communications that spun out of Georgia Tech has created a way for users to locate subject matter in an actual audio file simply by phonetically spelling and entering any term they want to find.

Story continues below


Say, for example, that you want to locate the word "Sudetenland" in an audio account of events leading up to World War II. According to Mark Clements, co-founder of the Atlanta-based company, you'd simply "sound out what Sudetenland sounds like. Take the name, Sue,' the city, Dayton,' and the word, land,' and string those together, type it in. That gets resolved into the set of phonemes you're looking for" (phonemes are units of sound in any language of which all its words are phonetically comprised). The Fast-Talk software finds the string of phonemes that correspond to the letters you enter and guides you to all spoken references to Sudetenland in the audio file. Because this tool bypasses the whole transcription and indexing process, it delivers results fast. According to Clements, the system processes "on the order of 30 hours of material per second."

This is important, says Dan Rasmus, an analyst at the market research firm Giga/Forrester, because "voice is one of those untapped resources that companies have." Jackie Fenn, who follows emerging technologies at Gartner, contends that Fast-Talk's "main value is in tapping into audio streams that you probably wouldn't really be able to get access to" otherwise. "It's not cost-effective to have a human do that," Fenn says.

Comments

Log In

Forgot your password?     Register »
Advertisement

Videos

White Matter
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.