Technology Review - Published By MIT
Advertisement

Searching Sportscasts

Continued from page 1

By Duncan Graham-Rowe

Thursday, June 21, 2007

smaller text tool iconmedium text tool iconlarger text tool icon


Once a new video clip is encoded using such patterns, the system looks for co-occurrences between the matched patterns and phrases. "In this way, the system is able to find correlations with events in the game, without requiring a human to explicitly design representations for any specific events," says Fleischman.

Giving precise figures on the accuracy of the system is difficult because there is no standard for judging. Even so, trials carried out by Fleischman and Roy involving searching six baseball games for occurrences of home runs showed promise. Using just visual search alone yielded poor results, as was the case using just speech. "However, when you combine the two sources of information, we have seen results that nearly double the performance of either one on their own," says Fleischman.

The researchers are now looking to extend this system to other sport-video archives, such as for basketball. But it shouldn't just benefit sports fans, says Fleischman.

In theory, the system could help with other video-search processes, such as security-video analysis, says David Hogg, a professor of computer science and head of the Vision Group at Leeds University, in the United Kingdom. This system is a very novel approach, he says, and one that shows the way forward for the unsupervised learning systems that are needed to make this kind of search automatic.

Using speech and visual information together is a powerful combination for machine learning, Hogg says. "In machine learning, it is very likely to be easier the more information there is available about each situation."

Speech can help remove ambiguities in visual data, and visual data can help disambiguate speech, says Richard Stern, a professor of electrical and computer engineering at Carnegie Mellon University, in Pittsburgh. It's a natural marriage, he says, but one that's just beginning to emerge.

Until recently, there has been relatively little use of ASR to aid in search, says Stern. "But this is all changing very rapidly," he says. "Google has been recruiting speech scientists aggressively for the past several years--another indication that multimedia search is moving from the research lab to the consumer very rapidly."

Comments

Log In

Forgot your password?     Register »
Advertisement

Videos

Making 3D Maps on the Move
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.