Technology Review - Published By MIT
Log in to My.TechnologyReview.com | Register
Advertisement

Friday, June 22, 2007

Human-Aided Computing

Microsoft researchers are trying to harness untapped brain power.

By Kate Greene

smaller text tool iconmedium text tool iconlarger text tool icon
Brain drain: The top picture is an artist’s rendering of subconscious computing, which, using EEG, accesses the processing power of the human brain for tasks--such as face recognition--that are difficult for machines. The bottom picture is a layout of the placement of EEG connections on the head.
Credit: Desnye Tan and Pradeep Shenoy of Microsoft Research

Despite all the power of computers, they are still lousy at certain simple tasks, such as recognizing faces and knowing the difference between a table and a cow. Now researchers at Microsoft are trying to tap into some of the specialized--and often subconscious--computing power in the human brain, and use it to solve problems that have so far been intractable for machines.

Desney Tan, a researcher at Microsoft Research, and Pradeep Shenoy, a graduate student at the University of Washington, have devised a scheme that uses electro-encephalograph (EEG) caps to collect the brain activity of people looking at pictures of faces and nonfaces, such as horses, cars, and landscapes. The pair found that even when the subjects' objective wasn't to distinguish the faces from the nonfaces, their brain activity indicated that they subconsciously identified the difference. The researchers wrote software that churns through the EEG data and classifies faces and nonfaces based on the subjects' response. When a single person viewed an image once, the system was able to identify faces with up to 72.5 percent accuracy. Results were even better using data from eight people who had viewed a particular image twice: accuracy jumped to 98 percent.

"Given that the brain is constantly processing external information," says Tan, "we can start to use the brain as a processor." In one scenario, he explains, pictures would be placed in people's peripheral vision, which doesn't require focused cognitive attention, so they could go about their daily tasks.

Today it takes relatively large supercomputers many hours to recognize faces--something a human can do almost instantly. One application for this face-recognition technique could be to use it for quickly sorting snapshots from surveillance videos to find frames with faces and those without, although Tan says this early work is mainly a proof of concept.

In addition to finding faces, Tan says, there is evidence that the strategy could be useful for identifying other types of objects, such as dogs or cats, and different types of words. Subconscious brain power could therefore improve automated image search by preclassifying objects to help a computer more accurately identify pictures.

It's not a new idea to use human brain power to supplement the abilities of computers, but most of this information is consciously provided by a person. For instance, Google's Image Labeler game lets people rack up points for identifying specific objects in pictures; the information is used to train machines to better classify pictures. But subconscious computing is a nascent field. "There are a bunch of ethical considerations before any of this can be taken to the mass market," Tan says. For example, how distracting would it be to have pictures flash in a person's peripheral vision?

"I think it's a pretty cool idea that has a lot of potential," says Luis von Ahn, a professor of computer science at Carnegie Mellon University, in Pittsburgh. However, he admits that quite a few people might have problems with the notion of their subconscious responses being recorded. "It's kind of freaky," he says.

Comments

  • Visual vs textual
    matthijs on 06/25/2007 at 4:27 AM
    Posts:
    1
    This is a good approach to image recognition and retrieval .. As I described in my thesis (http://photoindex.thingsdesigner.com/pdf/photoindex_mathesis_matthijsrouw.pdf), tagging with words comes with a lot of problems. It is kind of a bug fix to the semantic gap problem. I still feel that communicating about (e.g. indexing)  visual material should done withing the visual domain, not the textual domain. They are two completely different languages.
    Rate this comment: 12345
Advertisement

Current Issue

Technology Review September/October 2008
How Obama Really Did It
Social technology helped bring him to the brink of the presidency.
•  Subscribe
Save 41%
•  Table of Contents
•  MIT News

Magazine Services

Career Resources

MIT Technology Insider

Stories and breaking news from inside MIT about the latest research, innovations, and startups--in a convenient monthly e-newsletter. Subscribe today

Follow us on Twitter

Twitter

Get Technology Review updates via the web, cellphone, or Instant Messager – Follow techreview on Twitter!

Advertisement

More Technology News from Forbes

Advertisement
Advertisement
Advertisement
TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology