Technology Review - Published By MIT
Advertisement
[1] 2 Next »

October 2005

Translation by Numbers

Language Weaver's machine-translation software aids homeland security.

By Gregory T. Huang

smaller text tool iconmedium text tool iconlarger text tool icon

Bryce Benjamin knew a winner when he saw one. It was December 2001, and the infotech entrepreneur was meeting with two professors who were starting a company to commercialize "statistical machine translation." Their breakthrough: software that could learn automatically to translate text from one language into another.

Benjamin believed the technology was desperately needed for purposes of both homeland security and business communication. The company's location -- on the water in sunny Marina del Rey, CA -- didn't hurt either. "I looked out at the view," he says, "and I thought, 'This deal has a lot of promise.'"

Today the trio's 35-person startup, Language Weaver, is one of the leading companies in the burgeoning field of machine translation. For U.S. counterterror translators facing a growing backlog of untranslated audiotapes and communiques, software is increasingly the weapon of choice. Multinational corporations like Google, Yahoo, and Microsoft -- not to mention smaller companies with global staff -- are also driving the demand for machine translation of technical documents and Web pages.

Language Weaver's software translates text between English and half a dozen other languages, including Arabic, Chinese, and Spanish. So far, the technology is most useful as a screening tool that monitors reams of foreign-language news broadcasts, chat rooms, and websites. "People use our translation software to determine the relevance of information, as a triage function," says Benjamin, the company's CEO. "It's very good at telling what a certain passage is about."

Most machine-translation systems work on individual words or use complicated sets of translation guidelines, which must be devised by linguists and coded by hand. Language Weaver's technology, which company cofounders Kevin Knight and Daniel Marcu developed at the University of Southern California's Information Sciences Institute (ISI), takes a different tack. It uses human translation data, such as United Nations transcripts, to set up "parallel corpora" of text passages in two languages, aligned sentence by sentence.

[1] 2 Next »
October 2005

Would you like to read more articles from the October 2005 issue?

This article is from the October 2005 Issue of Technology Review. To read other articles from this issue simply register for My.TechnologyReview.com. It's free.

Subscribe today and save up to 41% »

Comments

Advertisement

Current Issue

Technology Review January/February 2009
Lifeline for Renewable Power
Without a radically expanded and smarter electrical grid, wind and solar will remain niche power sources.
•  Subscribe
Save 41%
•  Table of Contents
•  MIT News

Magazine Services

Career Resources

MIT Technology Insider

Stories and breaking news from inside MIT about the latest research, innovations, and startups--in a convenient monthly e-newsletter. Subscribe today
Advertisement

Follow us on Twitter

Twitter

Get Technology Review updates via the web, cellphone, or Instant Messager – Follow techreview on Twitter!

Advertisement

More Technology News from Forbes

Advertisement
Advertisement
TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology