Technology Review - Published By MIT
Advertisement

TR Editors' blog

Insights, opinions, and our editors' analysis of the latest in emerging technologies.

Blog Topics

Recent Posts

Recent Comments

  • nancy16 : When doing research on cancer. Scientist should not indulge in whether the cancer was inherited...
  • david k : There is strong history of the street view as art.  Ed Ruscha took photos along the Sunset Strip...
  • chimenti : Under NADIN what procedure does a pilot follow for submitting a flight plan and how is the...
  • fiberman : How amusing. A contributor to the WSJ suggests eating your fellow man. Well, isn't that just what...
  • kstauff : I believe the deficit left by the Bush administration for fiscal '08 was around $500 billion. ...
  • kstauff : You're right, I overestimated the number of democrats in both houses, although I believe that the...
  • kstauff : Are you as angry at Roosevelt, Truman, Kennedy, Johnson and Clinton for the wars they prosecuted?...
  • kstauff : The Obama administration told us it would be 8% without the stimulus.  You tell me if he and his...
  • ... : Just to make it apparent, there's already a Chromium browser which uses the Chrome codebase for...
  • Adalast : people keep throwing around the "New Deal" and saying that it was horrible and didn't help our...
Advertisement
Thursday, February 12, 2009

Are We Safeguarding Social Data?

Technical problems at Ma.gnolia.com raise questions about how social media is being protected.
By Erica Naone

Recently, the social bookmarking service Ma.gnolia crashed in a big way, losing about half a terabyte of data in the process, including the backup of that data.

Ma.gnolia founder Larry Halff admits that the backup system just wasn't solid enough. Halff has also posted some tools and information on the main Ma.gnolia page that should help users recover some of their data. However, it sounds as though the future of the service may now be in question.

The episode raises some interesting questions. Sebastien Paquet, a professor of computer science at the Universitédu Québec à Montréal, posted nine ways to protect data stored in the cloud, inspired by his own loss of data stored using Ma.gnolia. He and others have taken the opportunity to reexamine their trust in Web 2.0 services.

I say that the problem goes much further. For example, how safe is the data behind something like Wikipedia? Ed Chi, a senior research scientist at the Palo Alto Research Center (PARC), and his colleagues have been studying Wikipedia since about 2006 (see my article on their tool, WikiDashboard). In the course of their work, they struggled to get access to live data from Wikipedia that could make their tool more up to date and useful.

Chi told me, "What was difficult about doing this is that the Wikipedia Foundation has a lot of work done by volunteers, and getting access to the live data required application to a special research server, which is only occasionally administered on a volunteer basis. It's amazing that it works at all, as the server really doesn't get much attention and needed maintenance. It is such a valuable resource, so it really should get more attention and caring and feeding."

Chi also suggested that Wikipedia's data is not getting backed up as often as it should be. Losing data from Ma.gnolia is a personal loss for many, but Wikipedia is one of the great achievements of the digital age, and losing even a day's worth of edits would be significant. We can't afford to lose sight of the basic upkeep that will allow us to hold on to what we create.

Losing a large volume of socially created data would be a cultural loss. Wikipedia, Flickr, Twitter, WordPress, etc. store troves of data that will be valuable to future generations. I hope the data is being treated accordingly.

Comments

  • Use the application office
    If web applications can be available offline, like what Google did with their gmail, information can optionally be stored on end-user machines for further restoration.

    MobileTechGuy
    Rate this comment: 12345

    MobileTechGu...
    02/12/2009
    Posts:3
    Avg Rating:
    2/5
  • Public vs. private data
    Thanks for the link!

    Wikipedia's liberal license and open content has led to its replication all over the place. There are sites that simply republish Wikipedia along with ads for profit. Though these copies are probably neither systematic or up-to-date, they would mitigate the impact of a loss.

    The more private the data, the less it will be replicated. A lot of personal email exists only in one place; a crash at Hotmail or Gmail would be nearly unrecoverable for many.
    Rate this comment: 12345

    sebpaquet
    02/13/2009
    Posts:1
Advertisement

Log In

Forgot your password?     Register »
Advertisement
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
•  Subscribe
Save 36%
•  Table of Contents
•  MIT News
» Gift Subscription
» Digital Subscription
» Reprints, Back Issues
» Subscribe
» Table of Contents
» MIT News

More Technology News from Forbes

Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.