We noticed you're browsing in private or incognito mode.

To continue reading this article, please exit incognito mode or log in.

Not an Insider? Subscribe now for unlimited access to online articles.

A Smarter Search for What Ails You

Software searches through medical information by analyzing the structure of sentences in a new way.

Finding information on the Web isn’t hard, as long as you know what you’re looking for. Sometimes, though, the most useful information can remain hidden within the body of a complex document, and only the most carefully chosen combination of keywords will uncover it.

Semantic search technologies promise to help change this by returning more relevant information based on an understanding of the relationships between different words. Last week, Netbase Solutions, a company based in Mountain View, CA, released search software called Content Intelligence that organizes searchable content by analyzing sentence structure in a novel way. The company created a demonstration of the platform that searches through health-related information.

When a user enters the name of a disease, he or she is most interested in common causes, symptoms, and treatments, and in finding doctors who specialize in treating it, says Netbase CEO and cofounder Jonathan Spier. So the company’s new software doesn’t simply return a list of documents that reference the disease, as most search engines would. Instead, it presents the user with answers to common questions. For example, it shows a list of treatments and excerpts from documents that discuss those treatments. The Content Intelligence platform is not intended as a stand-alone search engine, Spier explains. Instead, Netbase hopes to sell it to companies that want to enhance the quality of their results.

The software uses an approach called natural-language processing to unravel the structure of sentences. Other semantic search technologies use the same approach but usually focus on keywords and how those words are related. For example, a tool intended for searching medical databases might be built to include information about common names for drugs and how those drugs are related to each other. As a result, those tools only work in specific subject areas and have to be adjusted whenever they are applied to a new topic, says Jens Tellefson, vice president of marketing and product strategy for Netbase.

In contrast, Netbase’s software focuses on recognizing phrases that describe the connections between important words. For example, when the system looks for treatments, it might search for phrases such as “reduce the risk of” instead of the name of a particular drug. Tellefson notes that this isn’t a matter of simply listing instances of this phrase, rather catching phrases with an equivalent meaning. Netbase’s system uses these phrases to understand the relationship between parts of the sentence. Tellefson says that this means the system can be used in different subject areas and does not need constant retraining.

Sentence analysis: Netbase’s software can identify treatments used for certain diseases and directs a user to relevant excerpts from documents.

Spier adds that Netbase has worked for five years to develop an algorithm that can search through millions of documents looking for the sorts of phrases that suggest that a sentence might be interesting and relevant.

The Content Intelligence platform is the technology behind a search system called illumin8, offered by the publishing company Elsevier. That product uses Netbase’s software to search through Web content and technical papers published by Elsevier. In response to a user’s query, illumin8 lists the key researchers involved with a topic, companies that have created related products, and potential customers for applications of the technology.

Colby Ellis, CEO of Elsevier’s engineering and technology group, says that the company hired Netbase because it wanted to build a tool that could make its content more accessible. Basic searches pull up lists of papers, which can be hard for a user to sort through. “Adding a technology that helps people discover what’s actually in the content without having to read it all is an extremely valuable time-saver,” Ellis says.

Susan Feldman, an IDC analyst who studies search technologies, says that Netbase’s ability to deal with massive amounts of information on the Web could prove valuable. “Natural-language processing has always been constrained by having to do so much processing ahead of time in order to deal with a lot of information,” Feldman says.

Spier says that this year, Netbase will focus on selling its platform to companies that offer health-care-related searches, but he stresses that it could easily be expanded to other subject areas.

Become an MIT Technology Review Insider for in-depth analysis and unparalleled perspective.

Subscribe today

Uh oh–you've read all of your free articles for this month.

Insider Premium
$179.95/yr US PRICE

Want more award-winning journalism? Subscribe to Insider Plus.
  • Insider Plus {! insider.prices.plus !}*

    {! insider.display.menuOptionsLabel !}

    Everything included in Insider Basic, plus the digital magazine, extensive archive, ad-free web experience, and discounts to partner offerings and MIT Technology Review events.

    See details+

    What's Included

    Unlimited 24/7 access to MIT Technology Review’s website

    The Download: our daily newsletter of what's important in technology and innovation

    Bimonthly print magazine (6 issues per year)

    Bimonthly digital/PDF edition

    Access to the magazine PDF archive—thousands of articles going back to 1899 at your fingertips

    Special interest publications

    Discount to MIT Technology Review events

    Special discounts to select partner offerings

    Ad-free web experience

You've read all of your free articles this month. This is your last free article this month. You've read of free articles this month. or  for unlimited online access.