Huge quantities of information are never more than a few clicks away on the Web, but it’s not always easy to see what things were like yesterday. News stories and blog posts might be archived, but other information often gets lost. For instance, while it’s trivial to find a book’s sales ranking on Amazon today, it’s less simple to see what it was last week. And for anyone curious about how news evolves, it might not be obvious how a story’s prominence has changed–did it get top billing on news sites the day it broke, or was it buried at the bottom of the page? A new tool called Zoetrope is designed to help track such information by letting users browse backward through time.
Other projects, such as the Internet Archive, already preserve historical versions of websites. But Mira Dontcheva, a research scientist in the Advanced Technologies Lab at Adobe Systems, where Zoetrope was developed, says the new tool makes it much easier to browse through this kind of data. “Having access to temporal information can help us come up with more compelling stories of what’s going on around us,” she says.
A user can peer back in time through Zoetrope in several ways. Simply pulling a scrollbar at the bottom of the browser winds a Web page back to show what it looked like hours, days, or months ago. Or, if the user is interested in one specific piece of information, like the price of a certain product, he or she can draw a “lens” over that area of the page to see how it changes.
An experienced user can perform even more-advanced analysis. For example, configured correctly, Zoetrope will recognize a price as it goes up or down and will show the results as a graph. It’s also possible to draw lenses on different websites and sync them in order to carry out a historical comparison. For example, a user could use one lens to track weather information and another lens to track movie-attendance figures. Looking at how both lenses change over time might reveal a correlation between bad weather and high movie turnout. Zoetrope can also track some pieces of data as they move about a page over time.
The system is limited, however, by how much historical data is available. To test the tool, the researchers chose 1,000 frequently updated websites and stored information captured every hour over four months.
But for Zoetrope to cover the entire Web would mean capturing huge amounts of data, says Eytan Adar, a PhD student at the University of Washington who was involved with the research. He has investigated the rates at which people tend to check different pages for updates and says that such information could provide insights into how often pages need to be recorded, thereby reducing the amount of data that needs to be stored. “It’s impossible to crawl and capture some of these things at the rate at which they’re changing,” Adar says. “But for something like Zoetrope, it’s a smaller percentage of the Web that we want to track. We don’t actually need to get every single page that’s out there.”
Kris Carpenter, who directs efforts to record Web pages at the Internet Archive, is enthusiastic about the new tool. “This is a fantastic leap forward,” she says, adding that Zoetrope could be used as a stand-alone application or eventually become part of the browser. “The advances of the interface are phenomenal in terms of being able to navigate data in a very different way and associate it across websites,” Carpenter says. “I think most users have an interest in trying to connect the dots between different sources of information, but there are almost no tools available to make that an easy thing to do.” She adds that the Internet Archive is interested in sharing its data with the Zoetrope researchers.
Adobe’s Dontcheva says that before Zoetrope can be released, her team intends to develop additional features to make it simpler and more powerful to use. “The interface is still evolving,” she says.