Skip to Content

Usually, studying the relationship between DNA and disease involves comparing the genomes of thousands of people with a disorder to the genomes of thousands of people who don’t. These studies can be expensive and may take years, requiring researchers to identify patients, enroll them in the study, and collect the genomic data.

A more cost-effective and speedier alternative is to mine the growing pool of genetic data in electronic medical records, report researchers in Nature Biotechnology on Sunday. These records chronicle a patient’s health care history, which can include physician’s notes, lab test results, and the billing codes hospitals submit to health insurance companies to receive payments.

The idea behind the new method for genetic discover is to be able to “reuse” the data in these records for medical discoveries, says Joshua Denny, a physician-scientist at Vanderbilt University School of Medicine.

To identify previously unknown relationships between disease and DNA variants, Denny and colleagues grouped around 15,000 billing codes from medical records into 1,600 disease categories. Then, the researchers looked for associations between disease categories and DNA data available in each record.

Their biggest new findings all involved skin diseases (just a coincidence, says Josh Denny, the lead author): non melanoma skin cancer and two forms of skin growths called keratosis, one of which is pre-cancerous. The team was able to validate the connection between these conditions and their associated gene variants in other patient data.

Unlike the standard method of exploring the genetic basis of disease, electronic medical records (EMRs) allows researchers to look for genetic associations of many different diseases at once, which could lead to a better understanding of how some single genes may affect multiple characteristics or conditions. The approach may also be less biased than disease-specific studies.

The study examined 13,000 EMRs, but in the future, similar studies could look benefit from much larger data sets. While not all patient records contain the genetic data needed to drive this kind of research, that is expected to change now that DNA analysis has become faster and more affordable in recent years and more and more companies and hospitals offer genetic analysis as part of medical care. When researchers have millions of EMRs at their finger tips, more subtle and complex effects of genes on disease and health could come to light. For example, it could allow for important studies on the genetics of drug side effects, which can be rare, affecting maybe 1 in 10,000 patients, Denny says.

Keep Reading

Most Popular

Large language models can do jaw-dropping things. But nobody knows exactly why.

And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.

OpenAI teases an amazing new generative video model called Sora

The firm is sharing Sora with a small group of safety testers but the rest of us will have to wait to learn more.

Google’s Gemini is now in everything. Here’s how you can try it out.

Gmail, Docs, and more will now come with Gemini baked in. But Europeans will have to wait before they can download the app.

This baby with a head camera helped teach an AI how kids learn language

A neural network trained on the experiences of a single young child managed to learn one of the core components of language: how to match words to the objects they represent.

Stay connected

Illustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at customer-service@technologyreview.com with a list of newsletters you’d like to receive.