Technology Review - Published By MIT
Advertisement

Surprising Search Patterns

Continued from page 1

By Kate Greene

Friday, August 18, 2006

smaller text tool iconmedium text tool iconlarger text tool icon

The explanation appears to be fairly simple: more and more people are searching for more specific information. If someone submits a general query, say, "bird flu," the results at the top of a search-engine's results page will indeed list high-traffic websites, for example, the Centers for Disease Control site. And that site's popularity will be reinforced. But Web searches are becoming increasingly more complex, according to Menczer. A search for "bird flu Turkey 2005" will bring up far fewer results, and lead to more obscure pages. "If you consider that people submit diverse queries that return a small number of hits," he says, "that means traffic is distributed to less-popular sites."

The results are somewhat controversial because many people have been operating under the assumption that a Googlearchy does exist, says Albert-László Barabási, professor of physics at the University of Notre Dame and also an expert on Internet behavior and how websites are connected to each other. He agrees with Menczer that general searches do make some types of sites more popular. "I think the message here is that as soon as you become a slightly more sophisticated searcher, then you're breaking the spell of the Web," he says.

The theory that people are becoming more adept in searching the Web is borne out by some hard data, too. According to Hitwise, a firm that tries to improve companies' search rankings, people are increasingly using more words per search query. Based on this trend, Menczer's research seems reasonable, says Bill Tancer, general manager of global research at Hitwise.

But Tancer also questions the quality of data used to test the researchers' models. For example, the traffic data for the research was gleaned from a free, downloadable search tool, Alexa, which provides Web statistics. But, according to Tancer, this data could be biased because Alexa users tend to be online marketers rather than average Web users.

In addition, the study used data from 2003, and "a lot has changed since then," says Tancer. Hitwise data, which is collected directly from Internet service providers such as AT&T, suggests that people interact with the Web in a number of ways, not just by either using searching engines or surfing. Tancer says people also end up on sites from directly typing in a URL, through sponsored links, where companies pay money to appear prominently on a search page, and through social networking sites.

Indiana's Menczer says that the paper, released last week in the Proceedings of the National Academy of Sciences, is a first attempt to show how Web data may or may not corroborate the idea of a Googlearchy. Currently, his group is exploring the effects of other modes of Web use, including social search, to see if sites such as digg.com and del.icio.us amplify or diminish his team's results.

Meanwhile, the Indiana researchers' work provides an important analysis of a commonly held assumption about search engines, says Matt Hindman, professor of political science at Arizona State University in Phoenix. Using "empirical data to model these relationships rather than just assume" is what had been missing, he says.

Comments

  • Is this really a surprise?
    If I'm looking for something very basic - exchange rates, or a newly released book - I'll go with the first couple of hits off a search engine.  If I'm searching for something very specific I don't want to see that I've got 400,000 pages that match my query.  I'm going to keep on adding terms until I'm down to a few thousand pages.  I would say that what you have in your first few lines on a page (or your data lines if those show up) have far more impact on whether or not I click through to your site then where you appear on the Google listing.  It's common for me, and most of my friends, to click through 10-12 pages of links before hitting on the few that I think will work for me.  I'm in fact far _less_ likely to click on the first one since overall those tend to be commercial sites.
    Rate this comment: 12345

    deirdrebeth
    08/18/2006
    Posts:25
    Avg Rating:
    3/5
    • Re: Is this really a surprise?
      If pages are ranked purely by PageRank (or some popularity measure), it is easy to see that Googlearchy will exist. However, search engines first pick up pages that match the queries and then rank the filtered pages by PageRank (plus many other parameters), then pages at the top are not necessarily popular globally, but they tend to be more popular than other pages within the filtered group. Then, googlearchy exists only for individual queries, which obviously is less of a problem.
      By the way, PageRank is based on the popularity of pages not websites. Also, authority is a better term than popularity, which seems to imply that authors link to pages without much thinking or editorial judgment.
      Rate this comment: 12345

      diklee
      08/18/2006
      Posts:1

Log In

Forgot your password?     Register »
Advertisement

Videos

Making 3D Maps on the Move
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.