Technology Review - Published By MIT
Advertisement

3-D Modeling Advance

A single photo can be reconstructed into a 3-D scene with Make3D.

By Brittany Sauser

Friday, March 07, 2008

smaller text tool iconmedium text tool iconlarger text tool icon

Researchers at Stanford University have developed a Web service called Make3D that lets users turn a single two-dimensional image of an outdoor scene into an immersive 3-D model. This gives users the ability to easily create a more realistic visual representation of a photo--one that lets viewers fly around the scene.

Immersive photos: This 3-D model of the Trevi Fountain in Rome (above) was created using Make3D, a Web service that reconstructs a single photo into a 3-D visualization.
Credit: Jeff/Stanford University
Multimedia
•  See a 3-D model of a building in Amsterdam.
•  See a 3-D model of the Trevi Fountain in Rome.
•  See a 3-D model of the Pyramids.

To convert the still images into 3-D visualizations, Andrew Ng, an assistant professor of computer science, and Ashutosh Saxena, a doctoral student in computer science, developed a machine-learning algorithm that associates visual cues, such as color, texture, and size, with certain depth values based on what they have learned from studying two-dimensional photos paired with 3-D data. For example, says Ng, grass has a distinctive texture that makes it look very different close up than it does from far away. The algorithm learns that the progressive change in texture gives clues to the distance of a patch of grass.

Larry Davis, a professor and chair of the computer-science department at the University of Maryland, in College Park, says that turning a single image into a 3-D model has been a hard and mathematically complicated problem in computer vision, and that even though Make3D gets things wrong, it often produces remarkable results.

Make3D is not the first site to extract a 3-D model from a single image. Researchers at Carnegie Mellon University (CMU) launched Fotowoosh in May 2007. (See "A New Dimension for Your Photos.") But Fotowoosh's algorithm is limited because it labels the orientation of surfaces as either horizontal or vertical, without taking into account such things as mountain slopes, rooftops, or even staircases, says Ng. Based on these restrictive assumptions, Fotowoosh infers the depth of a scene to reconstruct the image.

Derek Hoiem, who built Fotowoosh's algorithm, admits that unlike Make3D, his work does not give a good estimation of depth. "I can say this point [in an image] is a lot further [away from the foreground] than that point, but I can't say that this point is five meters away," he explains. Hoiem developed Fotowoosh with CMU faculty members Alexei Efros and Martial Hebert. (Hoiem is currently a postdoctoral fellow at the Beckman Institute at the University of Illinois at Urbana-Champaign.)

Saxena says that by knowing the depth values of objects in a scene, Make3D creates a higher-quality 3-D model. In a survey performed by the Stanford researchers in 2006, users preferred Make3D's images twice as often as images created using Fotowoosh.

Story continues below


Hoiem says that he has been trying to extend his work to deal with arbitrary angles, but he has yet to come up with a solution and is "impressed" with the Stanford researchers' work.

To build Make3D's algorithm, the Stanford researchers used a laser scanner to estimate the distance of every pixel or point in a two-dimensional image. That 3-D information was coupled with the image and reviewed by the algorithm so that it could learn to correlate visual properties in the image with depth values. For example, it will learn that a large blue pad is probably part of the sky and farther away, says Saxena. There are thousands of such visual properties that humans unconsciously use to determine depth. The Make3D algorithm learns these kinds of rules and processes images accordingly, he says.

Comments

  • 3D reconstruction
    Why not just snap a picture with a camera with 2 lenses set some distance apart. And reconstruct the 3D properties of the scene using parallax from 2 images of the same subject taken at the same time. Those kind of cameras should be easy to construct.
    Rate this comment: 12345

    SVE
    03/07/2008
    Posts:48
    Avg Rating:
    3/5
    • Re: 3D reconstruction
      I believe so you don't require a special camera for it to work, so you can take any older picture and make it '3d'

      The application of this is somewhat limited in it's current form, simply because it is unreliable if it is simply given random pictures.  on that note, with a few more advances and a combiniation with a few of the other imaging technologies I've seen on this site recently, it could produce some stunning real-time results.
      Rate this comment: 12345

      Shiladie
      03/10/2008
      Posts:56
      Avg Rating:
      4/5
  • a searchable 3D view of our world (inside & out)?
    Very soon, we should be able to reconstruct, view, and explore the entire world(inside & out(potentially)) in 3D. And the kicker is - you could view the same locations over time - or time lapse? My question is, can Google make it searchable (changes to the environment with dates/times/related history)? So you take the Make3D concept, and combine it with Photosynth and facial recognition.
    Link here: http://rantd.blogspot.com/2008/03/searchable-3d-view-of-our-world-inside.html
    Rate this comment: 12345

    donclark_atl...
    03/17/2008
    Posts:1

Log In

Forgot your password?     Register »
Advertisement

Videos

Malleable Maps, Artistic Robots and Bubble Interfaces
Technology Review January/February 2010

Current Issue

Security in the Ether
Information technology's next grand challenge will be to secure the cloud--and prove we can trust it.
Advertisement
Advertisement
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2010 Technology Review. All Rights Reserved.