Nick Reddyhoff

Notebooks

Updating Search

We need to reimagine the role of search engines and their sources of data.

  • May/June 2010
  • By Shashi Seth

Real-time search means retrieving information about what's happening, everywhere, now. The amount of real-time data that's available is growing rapidly with the proliferation of mobile devices. At Yahoo, we have already begun to incorporate real-time search results from Twitter and sources of developing news. But the scope of real-time data reaches far beyond tweets and Facebook updates. For example, users are uploading photos on Flickr to show what's happening around them, chatting about the latest news, and answering questions live on sites like Yahoo Answers. That's just the beginning of the real-time information that can be made available to search engines (see "TR10: Real-Time Search" ).

The sheer amount of real-time data presents unique challenges for search. Because a lot of the data is nonauthoritative, noisy, or spammy, search engines need to build trust models that can determine what data is important and influential. For example, retweets are not often useful results, and some data providers carry more authority than others. Search engines must also determine the right balance between timeliness and relevance to each user. Further, real-time data needs to be indexed and updated instantaneously. A few years ago, search engines took several hours to index. Today, they take only a few seconds--but they need to become even faster.

With the challenges of using real-time data come some exciting possibilities for reimagining search. As in the early days of the Web, when Yahoo built a directory to identify authoritative sites, we are seeing search engines building better trust models. Aggregators are emerging to qualify the reputations of sources. Many other types of self-organization are possible in this new realm.

We can imagine that to speed up the rate at which search engines are able to share real-time data, some sources will inform their indexes when something is happening. Rather than just waiting for search engines to crawl a site, users can push relevant new information. Say you are looking for a parking space in busy downtown San Francisco: parking lots might send updates to search engine indexes as spaces become available.

Advertisement

Already, we know that real-time search can serve needs other than those of traditional Web search. The resulting data can be invaluable in answering long-tail queries--those that aren't related to the most popular topics. The potential uses of real-time search are limitless. Finding--and inventing--new uses will make search even more valuable in our daily lives.

Shashi Seth is senior vice president of search products at Yahoo.

More in Notebooks

Biofuel Backlash

Read More »
Print

Related Articles

Future Gazing with Search Data

Search queries aren't always better than traditional trend-spotting methods.

Fewer Ads, More Clicks

Search engines get smarter at predicting when we'll respond to a pitch.

To comment, please sign in or register

Forgot my password

Advertisement

MAGAZINE

Can We Build Tomorrow's Breakthroughs?

Manufacturing in the United States is in trouble. That's bad news not just for the country's economy but for the future of innovation.

Videos

A Social-Media Decoder

More

Advertisement

Technology Review Lists

TR50

Our list of the 50 most innovative companies, including the following:

ARM Holdings

PrimeSense

BIND Biosciences

Facebook

More

Advertisement

Facebook

Advertisement