Closing the divide: Students surf the Web at Ghana’s Kokrobitey Institute, a conference center with an Internet connection only about four times as fast as dial-up. The link is enhanced by Princeton’s low-cost, low-power HashCache technology, which stores frequently accessed Web content.
Olivier Asselin/WPN

10 Emerging Technologies

TR10: HashCache

Vivek Pai's new method for storing Web content could make Internet access more affordable around the world.

  • March/April 2009
  • By David Talbot

Throughout the developing world, scarce Internet access is a more conspicuous and stubborn aspect of the digital divide than a dearth of computers. "In most places, networking is more expensive--not only in relative terms but even in absolute terms--than it is in United States," says Vivek Pai, a computer scientist at Princeton University. Often, even universities in poor countries can afford only low-bandwidth connections; individual users receive the equivalent of a fraction of a dial-up connection. To boost the utility of these connections, Pai and his group created HashCache, a highly efficient method of caching--that is, storing frequently accessed Web content on a local hard drive instead of using precious bandwidth to retrieve the same information repeatedly.

Despite the Web's protean nature, a surprising amount of its content doesn't change often or by very much. But current caching technologies require not only large hard disks to hold data but also lots of random-access memory (RAM) to store an index that contains the "address" of each piece of content on the disk. RAM is expensive relative to hard-disk capacity, and it works only when supplied with electricity--which, like bandwidth, is often both expensive and scarce in the developing world.

HashCache abolishes the index, slashing RAM and electricity requirements by roughly a factor of 10. It starts by transforming the URL of each stored Web "object"--an image, graphic, or block of text on a Web page--into a shorter number, using a bit of math called a hash function. While most other caching systems do this, they also store each hash number in a RAM-hogging table that correlates it with a hard-disk memory address. Pai's technology can skip this step because it uses a novel hash function: the number that the function produces defines the spot on the disk where the corresponding Web object can be found. "By using the hash to directly compute the location, we can get rid of the index entirely," Pai says.

Advertisement

To be sure, some RAM is still needed, but only enough to run the hash function and to actually retrieve a specific Web object, Pai says. Though still at a very early stage of development, HashCache is being field-tested at the Kokrobitey Institute in Ghana and Obafemi Awolowo University in Nigeria.

Video

Print

Related Articles

Spare Some Bandwidth?

Pakistani scientists have a way to boost download speeds.

Net Neutrality: Far from a Done Deal

Legislation and industry concessions won't guarantee equal access to the Internet.

The Internet Reborn

A grass-roots group of leading computer scientists, backed by Intel and other heavyweight industrial sponsors, is working on replacing today's Internet with a faster, more secure, and vastly smarter network: PlanetLab.

Close Comments

To comment, please sign in or register

Forgot my password

millikenresearch

12 Comments

  • 1079 Days Ago
  • 02/24/2009

slinging hash

Go man go.

Reply

mmlug

1 Comment

  • 1058 Days Ago
  • 03/17/2009

good idea.

We really want to use in Myanmar.

Reply

bobmonsour

1 Comment

  • 1041 Days Ago
  • 04/03/2009

This just won an award at Princeton also

At last night's Innovation Forum at Princeton, in a form called EdgeXL, the HashCache technology was part of an award-winning presentation by Vivek Pai. For the university paper article on the competition, see http://tinyurl.com/cpuuww

Reply

Jonalist

6 Comments

  • 451 Days Ago
  • 11/14/2010

Re: This just won an award at Princeton also

I see that article on the competition. What I was wondering is if commercial uses are to become available how much is going to stop porn and the porn industry from claiming has as its copyright for its client if they have no such copyright hash cache themselves of that particular has number they claim. Can nations just keep sitting back thinking that copyright owners are not violating their own cache while they present to the world porn industries objective, who has access to their cache plays with fire on share programs so how does that hash number get itself recognized as being the one and only copyright of such a film if there's no server dedicated commercially selling it that a share program has access? Isn't what I am saying is their unconstitutional act of aggression against society which don't deal in has numbering systems? What came first the chicken or the egg, hashcache answers so many perplexing questions of whom has a right to know - ARE WE YET TO KNOW THE TRUTH? I could also fear the hashcache bomb, how about you, leave it up to the Egyptians their computing safest facility is well below a mile of the surface of the earth where might they place a hashcache and who can access it "Al-Qaeda".

Reply

Jonalist

6 Comments

  • 451 Days Ago
  • 11/14/2010

Has is Hash for HashCache

I got in a hurry to send my comment but you got the idea I was attempting to show how some HASH NUMBERING SYSTEM is not evident to all whom have caches of their own servers and of those businesses I chose to illustrate how a porn industry abuses a paid for Copyright by claiming to a Agency a Hash Number that they think is Copyrighted as their Product Name they sell on CD or DVD or on a VHS Tape. Is this their indecent truth unraveled revealing unconstitutional right acquisition worldwide or is it just in California where it is illegal to force someone to purchase porn if they don't even have it. I would refer to a 'HashCache Bomb' that causes the computer to crash having to reinstall all the files. Is this possible, Agents are making tons of money from sharers worldwide and most of those people they charge violated a Porn Clients Rights are going to jail for five years and when they return their computer will crash if it hasn't yet? Snitches that are lurking sharing programs seeking all the info on a sharer that just happens to have downloaded a particular film don't know that they are violating constitutional rights of individuals by hiding in behind a Agent that wants to know whom that individual is that has a Has Numbered File he seeks cause his Client paid him to sniff out anyone, and that Agent don't even own stock in the servers that sharers access nor the rights to claim anything about a particular share program. Big Beef taking place in California violates California Laws, I don't have respect for them even if they used clean unadultrated words in their Un-Published/Un-Copyright Supported Hash Numbered Films. What's That HashCache doing in a Legal Courtroom Summary if it was never Published by the so called Clients and Agents don't Publish that sort of stuff and expect that a court can grant them the Freedom we all enjoy because they are the Law Violators? Go ahead make a HashCache Numbering System complete so that schools can learn the Truth whenever the United States Justice System decides it is the appropriate time to tell everyone on God's Earth that its all been a lie so far and they are going to correct their lies, just DON'T VOTE FOR THEM ANYMORE OK! It is like me writing a program to safeguard Copyrights, everything that would become protected has to come first before introduction, anything that comes after the fact follows the same guidelines, the same issue came up during MP3 Legislation that a MP3 program does not recognize a particular Hash Password System in order to play that file which is required if online or offline and you paid for it regardless if it is not entered it cannot play - Caching Garbage Data. How do they cope with the lie, as long as you don't learn what is taking place they are happy people are going to prison instead of operating their home computer from a Public Facility or College - HashCache Those Prison Systems for Payroll for the mistake & HealthCare!

Reply

Advertisement

MAGAZINE

Can We Build Tomorrow's Breakthroughs?

Manufacturing in the United States is in trouble. That's bad news not just for the country's economy but for the future of innovation.

Videos

A Social-Media Decoder

More

Advertisement

Technology Review Lists

TR50

Our list of the 50 most innovative companies, including the following:

Facebook

Geron

Groupon

BIND Biosciences

More

Advertisement

Facebook

Advertisement