Technology Review - Published By MIT
Log in to My.TechnologyReview.com | Register
Advertisement

Friday, April 27, 2007

A New Dimension for Your Photos

Continued from page 1

By Wade Roush

smaller text tool iconmedium text tool iconlarger text tool icon

In fact, parallax isn't strictly required for 3-D vision: if you shut one eye, the world doesn't go flat. The brain infers depth using all sorts of cues such as shading, color, motion, and our learned experience about the spatial relationships between floors and walls, or between streets and buildings. "It turns out that using a fairly simple model--thinking of the world in terms of a ground surface, vertical surfaces that stick up out of it, and the sky--you can create pretty compelling 3-D models," says Hoiem.

The software that he, Efros, and Hebert developed starts converting an image by trying to group each pixel in a two-dimensional image into one of these classes. Sky is usually the easiest--it's blue or white. The top and bottom edges of most photos are aligned with the horizon, which helps the software identify the ground plane. And the windows of a multistory building are often arranged in parallel lines with a common vanishing point--a strong indication of a vertical surface.

But Hoiem didn't explicitly teach the software these rules. The system is based on machine-learning algorithms, meaning that it figures out its own rules of thumb by recognizing statistical patterns in hundreds of images in which the ground, sky, and vertical surfaces have been prelabeled by humans.

"We didn't have to start completely from scratch, fortunately," says Hoiem. "There's been a lot of work on how we represent color and texture and structure. There is an existing algorithm for recognizing the vanishing point of a group of lines. And people have worked a lot on recognizing objects like people or cars. But nobody had thought that maybe you can combine all of these and learn to recognize the actual geometry of a scene."

Once Fotowoosh has identified the major surfaces in a scene, it joins them into a 3-D model using the Virtual Reality Markup Language file format, or VRML. The software peels off parts of the two-dimensional image and pastes them onto the appropriate surfaces in the model, a process called texture mapping.

Currently, the finished models can only be viewed inside a Web browser equipped with a special extension for viewing VRML files. But in the beta version of Fotowoosh, due next month, the models will be displayed using the more common Flash format already included in most browsers, according to Pishevar. (The Fotowoosh home page includes a video demonstrating the end product for several sample images.)

Right now, the system isn't very good at separating discrete objects that should be in the foreground, such as pedestrians in a street scene, from background surfaces, such as walls. But Hoiem is working on that. "In a year or possibly less, you'll be able to take a photo of an alley with all sorts of cars and people, and create a 3-D model where those are all seen as separate 3-D foreground objects," he says.

Comments

  • Maybe another application
    briang1621 on 04/30/2007 at 8:41 PM
    Posts:
    31
    Avg Rating:
    1/5
         Obviously a talent artist can create a full 3D clay model from a single picture; however, with this recent invention highlighted in the article “A New Dimension for Your Photos” allows computers to now recreate a similar 3D environment from a single picture. Sadly, that is without the clay.
          It seems that the applications that they have mentioned in this article for the software like ‘3D flicker websites, pasting 3D pictures into Google Earth or Third-Life’ are more entertainment based than actually profitable. Maybe they should step back and examine the markets for digital photo editing software and 3D rendering software. The owners of programs like Photoshop, Illustrate, and Lightwave 3D may be more interest in using this software to make advertisements or illustrations more quickly or more life-like. Regardless I still think the technology is cool and really like the video clips that were linked to the technologyreview.com
       Thank you
    Brian Glassman
    Pembroke Pines, Florida

    www.techrd.com
    Rate this comment: 12345
    • Re: Maybe another application
      evolvingwheel on 05/02/2007 at 9:00 PM
      Posts:
      5
      I was wondering, will 3D rendition be able to change the world of advertising in web media? if the browser is capable with tools to render images in 3D aspect, and you observe all bunch of stuff popping out here and there from the background, won't that be intense? or may be not... considering the fact that in real world we do see them around us. It just a change in experience pattern so far. Now will that rendition generate more revenue? That I don't know. Another question arises - how well the software will be able to extrapolate information around the edges to create the depth perception across different demarkations and how much training will confirm a close to real picture layout?

      http://innovech.wordpress.com
      Rate this comment: 12345
  • Quakr
    hardbutnot on 07/30/2007 at 8:25 AM
    Posts:
    1
    I wonder if you've seen the Quakr project - an attempt to build a 3d world by "simply" placing Flickr images in their 3d position.  It's very much related to this post.  Go have a play! http://www.quakr.co.uk/
    Rate this comment: 12345
Advertisement

Current Issue

Technology Review May/June 2008
An Electrifying Startup
A new lithium-ion battery from A123 Systems could help electric cars and hybrids come to dominate the roads.
•  Subscribe
Save 41%
•  Table of Contents
•  MIT News

Magazine Services

Career Resources

MIT Technology Insider

Stories and breaking news from inside MIT about the latest research, innovations, and startups--in a convenient monthly e-newsletter. Subscribe today
Advertisement

More Technology News from Forbes

Advertisement
Advertisement
Advertisement
TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology