Computing

3-D Modeling Advance

(Page 2 of 2)

  • Friday, March 7, 2008
  • By Brittany Sauser

To process an image, the algorithm divides the still image into tiny pieces or segments, says Ng. "It tries to take each of these small pieces and simultaneously figure out their 3-D position, angle, and orientation in the image."

When a new image is uploaded on the site, it only takes a couple of minutes for the algorithm to reconstruct it to a 3-D model and make a movie of the scene. However, the website is not yet optimal, so it takes about an hour for the user to receive an e-mail message indicating that her visualizations are ready. A user can store images and movies in a personal gallery on the site. The researchers are working to connect their site to photo-sharing sites like Photobucket and Flickr, says Saxena.

Make3D can also take two or three images of the same location to create a 3-D model similar to Microsoft's Photosynth application. (See "Microsoft's Shiny New Toy.") But Photosynth is a more expansive project that uses hundreds of images to reconstruct a scene, and when there are that many images to work with, computing the depth of scenes is not as mathematically complicated and is more accurate, says Hoiem. Make3D's focus is on processing single images for the general consumer, who might only take one image of a scene, says Ng.

Alex Daley, the group product manager for Microsoft Live Labs, says that there is a complementary relationship between single-image processors and multiple-image processors: improving single-image processing will ultimately make it easier for other systems to match multiple photos together. "Mixing and matching these for the right set of images will provide the best set of results," Daley adds. (He says that Microsoft is open to working with applications such as Make3D, but the company has not yet spoken with the Stanford researchers.)

Make3D's current algorithm only works on outdoor scenes or landscapes and a few kinds of indoor scenes, such as those that focus on staircases, and it's meant to help users share experiences or relive their own. The researchers are working to extend the algorithm to a broader range of settings so that it can recognize things like humans and coffee mugs and be used to create real-life environments for gaming and virtual worlds. Saxena is also working to incorporate the technology into robots to improve navigation and assist them at carrying out such tasks as unloading a dishwasher.

CMU's Efros says that the work provides a new perspective on the computer-vision problem and will hopefully result in a deeper understanding of how human vision functions.

Print

Related Articles

A Face-Finding Search Engine

A new approach to face recognition is better at handling low-resolution video.

Microsoft's Shiny New Toy

Photosynth is still a work in progress. It is dazzling, but what is it for?

A New Dimension for Your Photos

Web service Fotowoosh wants to be the Flickr of 3-D.

Close Comments

To comment, please sign in or register

Forgot my password

SVE

51 Comments

  • 1439 Days Ago
  • 03/07/2008

3D reconstruction

Why not just snap a picture with a camera with 2 lenses set some distance apart. And reconstruct the 3D properties of the scene using parallax from 2 images of the same subject taken at the same time. Those kind of cameras should be easy to construct.

Reply

Shiladie

56 Comments

  • 1436 Days Ago
  • 03/10/2008

Re: 3D reconstruction

I believe so you don't require a special camera for it to work, so you can take any older picture and make it '3d'

The application of this is somewhat limited in it's current form, simply because it is unreliable if it is simply given random pictures.  on that note, with a few more advances and a combiniation with a few of the other imaging technologies I've seen on this site recently, it could produce some stunning real-time results.

Reply

donclark_atlanta

1 Comment

  • 1429 Days Ago
  • 03/17/2008

a searchable 3D view of our world (inside & out)?

Very soon, we should be able to reconstruct, view, and explore the entire world(inside & out(potentially)) in 3D. And the kicker is - you could view the same locations over time - or time lapse? My question is, can Google make it searchable (changes to the environment with dates/times/related history)? So you take the Make3D concept, and combine it with Photosynth and facial recognition.
Link here: http://rantd.blogspot.com/2008/03/searchable-3d-view-of-our-world-inside.html

Reply

Advertisement

MAGAZINE

Can We Build Tomorrow's Breakthroughs?

Manufacturing in the United States is in trouble. That's bad news not just for the country's economy but for the future of innovation.

Sponsored Content

Technologies from National Instruments

Adding Data Logging
Log measured data to a file and open it in Microsoft Excel

> Click here for more National Instruments Videos <
Whitepaper

Temperature Measurements with Thermocouples: How-To Guide

This document is part of the “How-To Guide for Most Common Measurements” centralized resource portal. This tutorial provides a detailed guide for measurement and device considerations to take temperature measurements using thermocouples. Get an introduction to thermocouples, which are inexpensive sensing devices widely used with PC-based data acquisition systems. Also review some specific thermocouple examples and learn how thermocouples work and ways to integrate them into a data acquisition measurement system.

View full PDF > Listen to story >
Find us on Youtube

Videos

A Robot Recruit that Can Do It All

More

Advertisement

Technology Review Lists

TR50

Our list of the 50 most innovative companies, including the following:

Apple

1366 Technologies

ARM Holdings

Ushahidi

More

Advertisement

Facebook

Advertisement