The Human Genome Project gave researchers an important initial roadmap to the human gene sequence, but it’s a map that might prove tough to navigate, given that the function and structure of most of the proteins that do the work for those genes remains a mystery.
That’s why the Human Proteome Folding Project – a recent collaboration between IBM, United Devices, the Institute for Systems Biology and the University of Washington – is picking up where the Human Genome Project left off. Understanding the form and function of these proteins, which are at the core of many diseases and the natural target for many treatment drugs, will ultimately put researchers that much closer to understanding why certain diseases happen and how to treat and cure them.
“The Human Genome Project is the foundation on which this project sits,” say Dr. Rich Bonneau, senior scientist for the Institute for Systems Biology, the Seattle, Washington non-profit research institute that is spearheading the biology research effort for the Human Proteome Folding Project.
But running the computations necessary to create such a catalog could take literally a million years on a state-of-the-art PC – 50 years if you used a substantially more powerful commercial 1,000-node cluster computer. But by ‘borrowing’ unused computing cycles from volunteers who download a program to their PCs – a la SETI@Home – researchers believe they can get at least a rough sketch of more than 100,000 proteins before the end of this year.
Tens of thousands of people have already downloaded the program through IBM’s and United Devices’ grids, both of which are being used for this program, and that number is hoped to quickly reach into the millions.
The Beginning of a Beautiful Friendship
The Institute for Systems Biology (ISB) was approached about 18 months ago by United Devices, an Austin technology firm that makes software for grid computing. The software company had a vested interest in life science-related projects as it counts five of the world’s six biggest pharmaceutical companies as customers, and has been running its own distributed community grid since April 2001.
As with the well-known SETI@Home project, computer users can go online to United Devices’ site at www.grid.org, and download a client that runs computations on their computer’s unused cycles. Using the huge computational power of its own grid, United Devices has run previous projects looking into anti-viral leads for smallpox, and screening molecules to find treatments for anthrax.
When United Devices approached the Institute for Systems Biology, says Ed Hubbard, the software company’s president and founder, he was looking for “problems that they might solve” by harnessing the grid’s computing strength.
And, as it turned out, mapping human protein structures was a good fit.
Researchers at the Institute were already using Rosetta, a software package developed at the University of Washington to predict the structure of proteins. But while Rosetta offers a good forecast for what these proteins might look like, its predictions are not infallible. Researchers still need to run countless computations to determine the accuracy of a protein’s three-dimensional structure.
That’s where the grid comes in handily.