Skip to Content

The Holes in Our Genomes

Scanning DNA for structural changes brings new insight into disease.
September 19, 2008

Over the past two years, scientists have made a surprising discovery about our DNA. Like a book with torn pages, duplicate chapters, or upside-down paragraphs, everyone’s genome is riddled with large mistakes. These “copy number variations” can include deletions, duplications, and rearrangements of stretches of DNA ranging in size from one thousand to one million base pairs. New tools to screen for such mistakes, described this month in Nature Genetics, should generate a more complete picture of the genetic root of common diseases.

Hole-y DNA: Gene microarrays (shown above) designed to measure two different kinds of genetic variation–single-base changes and larger structural variations–are shedding new light on the genetic basis of complex disease.

“There has been a shock at the prevalence of this kind of variation and a desire to characterize it more fully and to integrate it into genome-wide studies of disease,” says Matthew Hurles, a geneticist at the Wellcome Trust Sanger Institute, in Cambridge, U.K., who was not involved in either study. “Now we have the tools that will enable those discoveries.”

Over the past few years, advances in gene microarray technologies, which can quickly survey large volumes of DNA, have allowed scientists to screen more human genomes than ever before, resulting in a flood of information linking specific genes to disease. Most of these studies begin by looking for single-letter changes in the DNA code, called single-nucleotide polymorphisms, or SNPs (pronounced “snips”). SNPs found more often in people with a particular disease point researchers to genetic variations that might play a role in that disease. Scientists have so far identified 200 disease-linked genes using this approach, but even large studies of thousands of patients have uncovered genetic variations that account for only a small proportion of complex diseases. In type 2 diabetes, for example, the 18 disease-linked genes that have been identified explain perhaps 5 percent of the disease’s heritability.

Scientists have now adapted these microarrays to identify both small SNP changes and copy number variations, which they hope will help them identify a larger fraction of disease-causing genes. In one of the papers in Nature Genetics, David Altshuler, a physician and scientist at the Broad Institute, in Cambridge, MA, and his collaborators described the design of such a chip, in collaboration with genomics instrument maker Affymetrix, which they then used to map this kind of variation.

Altshuler’s team assayed the DNA of 270 people whose genomes were already being studied as part of the HapMap project, which is cataloguing common genetic variants. They found that most copy number variations are inherited, as SNPs are, rather than arising anew in individuals. That news is likely to be a relief to geneticists, because it means that they can survey many structural changes by employing the same high-throughput approach used to catalogue SNPs.

The Broad group and others are now using microarrays that look for copy number variations to study a variety of common diseases. “In the next couple of years, we should really start to see new insights into the disease-causing mechanisms that result from this kind of mechanism,” says Hurles. In fact, two such studies have already yielded important insights, identifying rare deletions linked to autism and schizophrenia. “If we can interrogate both kinds of variation in the same patient in the same experiment, we can get an integrated understanding of how variations come together to influence disease,” says Steven McCarroll, a geneticist at the Broad and lead author of the paper.

Still, some types of copy number variations may be going undetected. In a second paper published in Nature Genetics, Greg Cooper and his colleagues at the University of Washington compared data collected using microarrays sold by Illumina with a sequencing-based assay published last year. They found that the array missed a number of changes centered on so-called hot spots, where multiple duplications–a string of four or five copies of a gene–make DNA difficult to study. Cooper says that different approaches will likely be needed to study these changes.

“The more detail we can get about our genomes–each peeling of the layer of the onion–teaches us more about disease,” says Altshuler. “The technology is moving in parallel, so as we move further, we can investigate each layer in detail.”

Keep Reading

Most Popular

Geoffrey Hinton tells us why he’s now scared of the tech he helped build

“I have suddenly switched my views on whether these things are going to be more intelligent than us.”

ChatGPT is going to change education, not destroy it

The narrative around cheating students doesn’t tell the whole story. Meet the teachers who think generative AI could actually make learning better.

Meet the people who use Notion to plan their whole lives

The workplace tool’s appeal extends far beyond organizing work projects. Many users find it’s just as useful for managing their free time.

Learning to code isn’t enough

Historically, learn-to-code efforts have provided opportunities for the few, but new efforts are aiming to be inclusive.

Stay connected

Illustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at with a list of newsletters you’d like to receive.