Technology Review - Published By MIT
Advertisement

3-D Modeling Advance

Continued from page 1

By Brittany Sauser

Friday, March 07, 2008

smaller text tool iconmedium text tool iconlarger text tool icon

To process an image, the algorithm divides the still image into tiny pieces or segments, says Ng. "It tries to take each of these small pieces and simultaneously figure out their 3-D position, angle, and orientation in the image."

When a new image is uploaded on the site, it only takes a couple of minutes for the algorithm to reconstruct it to a 3-D model and make a movie of the scene. However, the website is not yet optimal, so it takes about an hour for the user to receive an e-mail message indicating that her visualizations are ready. A user can store images and movies in a personal gallery on the site. The researchers are working to connect their site to photo-sharing sites like Photobucket and Flickr, says Saxena.

Make3D can also take two or three images of the same location to create a 3-D model similar to Microsoft's Photosynth application. (See "Microsoft's Shiny New Toy.") But Photosynth is a more expansive project that uses hundreds of images to reconstruct a scene, and when there are that many images to work with, computing the depth of scenes is not as mathematically complicated and is more accurate, says Hoiem. Make3D's focus is on processing single images for the general consumer, who might only take one image of a scene, says Ng.

Alex Daley, the group product manager for Microsoft Live Labs, says that there is a complementary relationship between single-image processors and multiple-image processors: improving single-image processing will ultimately make it easier for other systems to match multiple photos together. "Mixing and matching these for the right set of images will provide the best set of results," Daley adds. (He says that Microsoft is open to working with applications such as Make3D, but the company has not yet spoken with the Stanford researchers.)

Make3D's current algorithm only works on outdoor scenes or landscapes and a few kinds of indoor scenes, such as those that focus on staircases, and it's meant to help users share experiences or relive their own. The researchers are working to extend the algorithm to a broader range of settings so that it can recognize things like humans and coffee mugs and be used to create real-life environments for gaming and virtual worlds. Saxena is also working to incorporate the technology into robots to improve navigation and assist them at carrying out such tasks as unloading a dishwasher.

CMU's Efros says that the work provides a new perspective on the computer-vision problem and will hopefully result in a deeper understanding of how human vision functions.

Comments

  • 3D reconstruction
    Why not just snap a picture with a camera with 2 lenses set some distance apart. And reconstruct the 3D properties of the scene using parallax from 2 images of the same subject taken at the same time. Those kind of cameras should be easy to construct.
    Rate this comment: 12345

    SVE
    03/07/2008
    Posts:48
    Avg Rating:
    3/5
    • Re: 3D reconstruction
      I believe so you don't require a special camera for it to work, so you can take any older picture and make it '3d'

      The application of this is somewhat limited in it's current form, simply because it is unreliable if it is simply given random pictures.  on that note, with a few more advances and a combiniation with a few of the other imaging technologies I've seen on this site recently, it could produce some stunning real-time results.
      Rate this comment: 12345

      Shiladie
      03/10/2008
      Posts:55
      Avg Rating:
      4/5
  • a searchable 3D view of our world (inside & out)?
    Very soon, we should be able to reconstruct, view, and explore the entire world(inside & out(potentially)) in 3D. And the kicker is - you could view the same locations over time - or time lapse? My question is, can Google make it searchable (changes to the environment with dates/times/related history)? So you take the Make3D concept, and combine it with Photosynth and facial recognition.
    Link here: http://rantd.blogspot.com/2008/03/searchable-3d-view-of-our-world-inside.html
    Rate this comment: 12345

    donclark_atl...
    03/17/2008
    Posts:1

Log In

Forgot your password?     Register »
Advertisement

Videos

Making 3D Maps on the Move
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.