Technology Review - Published By MIT
Advertisement

Improved Visual Search

Researchers are trying to make computers see as we do.

By Neil Savage

Thursday, May 25, 2006

smaller text tool iconmedium text tool iconlarger text tool icon

Search engines work wonderfully when you want to find something in a long stretch of text. Just type in a word or phrase, and the computer quickly scans through a Web page or Word document and picks it out. But for a computer to do the same thing with an image -- find a particular person or object somewhere in a video recording, for instance -- is much more difficult. Whereas a human eye instantly distinguishes a tree from a cat, it's a lot of work to teach a computer to do the same.

That challenge is being tackled by researchers at MIT's Center for Biological and Computational Learning (CBCL), led by Tomaso Poggio, the Eugene McDermott Professor in the Brain Sciences and Human Behavior. Some students at the center are proposing software that could work, say, with surveillance cameras in an office building or military base, eliminating the need for a human to watch monitors or review videotapes. Other applications might automate computer editing of home movies, or sort and retrieve photos from a vast database of images. It might also be possible to train a computer to perform preliminary medical diagnoses based on an MRI or CT scan image.

But the work to make such exciting applications possible is daunting. "The fact that it seems so easy to do for a human is part of our greatest illusion," says Stanley Bileschi, who this month earned his PhD in electrical engineering and computer science at the CBCL. Processing visual data is computationally complex, he says, noting that people use about 40 percent of their brains just on that task. There are many variables to take into account when identifying an object: color, lighting, spatial orientation, distance, and texture. And vision both stimulates and is influenced by other brain functions, such as memory and reasoning, which are not fully understood. "Evolution has spent four billion years developing vision," Poggio says.

Scientists have traditionally used statistical learning systems to teach computers to recognize objects. In such systems, a scientist tells a machine that certain images are faces, then tells it that certain other images are not faces. The computer examines the images pixel by pixel to figure out, statistically, what the face images have in common that the nonface images do not.

For instance, it might learn that a set of pixels representing the brow is a brighter than the pixels representing the pupils, and that the two sets are a standard distance apart. It might notice that the mouth tends to be horizontal, and that there is a sharp change in brightness where the head stops and the background begins. Once trained, it can look at new images and see how closely they match the rules.

Comments

  • to think that our complexity and the billions
    of other creatures all living on
    a tiny planet is an accident of a few billion years is your first mistake. God created the world looking billions in order to test man and give him free choice.repent before it is too late. peak oil and global warming should be a good enough warning.
    Rate this comment: 12345
    Guest (dav)
    05/25/2006
    Posts:1
    • sure enough
      there is an old saying: if our brains were simple enough for us to comprehend them--we would be too simple to care. Although it looks simple in sci-fi, real robots are and sentient computers are massively far from us. Of course, that is why we keep researching.
      Rate this comment: 12345
      Guest (kitk)
      05/26/2006
      Posts:1
    • Science, science...
      Hi dav,

      Don't loose the point of scietific research. We assume the Universe (so as we) to some extent is understandable. That means it has rules and we are able to recognize them. That works with computer vision too: we want to recreate what nature has to present days.

      Some reading on molecular biology, cosmology and genetics may clear up your mind about evolution and creation.

      Thanks.
      Rate this comment: 12345
      Guest (Santiago)
      05/26/2006
      Posts:1

Log In

Forgot your password?     Register »
Advertisement

Videos

Making 3D Maps on the Move
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.