Hello,

We noticed you're browsing in private or incognito mode.

To continue reading this article, please exit incognito mode or log in.

Not an Insider? Subscribe now for unlimited access to online articles.

Internet Artifacts

Web

The Internet is, by its very nature, a transitory medium-pages come and go. But if you had a publicly available Web page in the past three years, chances are that a copy of it is in the collection of the Internet Archive, a nonprofit group that saves “snapshots” of the Internet.

The Archive was founded by Brewster Kahle, whose San Francisco-based Web browser company, Alexa Internet, collects the snapshots every two months and donates the digital tapes to the Archive. As of May, the Archive was in excess of 13 terabytes (a terabyte is 1 million megabytes); in comparison, the Library of Congress holds the equivalent of about 20 terabytes. The Archive is stored in two separate machines in different locations. “It’s too important to have in one place. An earthquake could cause destruction of a collection that’s as large as the largest library ever built by humans,” says Kahle.

But it is proving easier to save the information than to sort through it for any useful purpose. While recent data are stored on disk for quick retrieval, the bulk of the archive is in a library of digital tapes that are too slow to search effectively. Currently, the only way the public can get at it is through the Alexa toolbar (downloadable at www.alexa.com), but, at the time TR went to press, only about the last six months of snapshots were available. When the reading room for these massive stacks is finally built, however, the Archive will be quite a collection.

Become an MIT Technology Review Insider for in-depth analysis and unparalleled perspective.

Subscribe today
Want more award-winning journalism? Subscribe and become an Insider.
  • Insider Plus {! insider.prices.plus !}* Best Value

    {! insider.display.menuOptionsLabel !}

    Everything included in Insider Basic, plus the digital magazine, extensive archive, ad-free web experience, and discounts to partner offerings and MIT Technology Review events.

    See details+

    Print + Digital Magazine (6 bi-monthly issues)

    Unlimited online access including all articles, multimedia, and more

    The Download newsletter with top tech stories delivered daily to your inbox

    Technology Review PDF magazine archive, including articles, images, and covers dating back to 1899

    10% Discount to MIT Technology Review events and MIT Press

    Ad-free website experience

  • Insider Basic {! insider.prices.basic !}*

    {! insider.display.menuOptionsLabel !}

    Six issues of our award winning print magazine, unlimited online access plus The Download with the top tech stories delivered daily to your inbox.

    See details+

    Print Magazine (6 bi-monthly issues)

    Unlimited online access including all articles, multimedia, and more

    The Download newsletter with top tech stories delivered daily to your inbox

  • Insider Online Only {! insider.prices.online !}*

    {! insider.display.menuOptionsLabel !}

    Unlimited online access including articles and video, plus The Download with the top tech stories delivered daily to your inbox.

    See details+

    Unlimited online access including all articles, multimedia, and more

    The Download newsletter with top tech stories delivered daily to your inbox

/3
You've read of three free articles this month. for unlimited online access. You've read of three free articles this month. for unlimited online access. This is your last free article this month. for unlimited online access. You've read all your free articles this month. for unlimited online access. You've read of three free articles this month. for more, or for unlimited online access. for two more free articles, or for unlimited online access.