Technology Review - Published By MIT
Advertisement

A Better Way to Rank Expertise Online

New software distinguishes between experts and spammers, showing who can be trusted.

By Brittany Sauser

Friday, July 31, 2009

smaller text tool iconmedium text tool iconlarger text tool icon

Websites where users can organize and share information are flourishing, but it can be hard to know which users and information to trust. Now a team of European researchers has developed an algorithm that ranks the expertise of users and can spot those who are using a site only to spam.

Credit: Technology Review

The technique works in a way similar to Amazon's reputation engine or the ratings of Wikipedia pages, but it evaluates users based on a new set of criteria that makes intuitive assumptions about experts.

The algorithm draws on a method applied in ranking Web pages, but takes it an interesting step further, says Jon Kleinberg, a professor of computer science at Cornell University in Ithaca, NY, who was not involved with the work. "It distinguishes between 'discoverers' and 'followers,'" Kleinberg says, "focusing on users who are the first to tag something that subsequently becomes popular."

The new work focuses on collaborative tagging systems such as Delicious, a social bookmarking website, and Flickr, a photo-sharing site. These sites let users add relevant keywords to "tag" Web links or photos and then share them. Normally, users are ranked by how frequently or how recently they add content to the system. "It's quantity over quality, so the more you do, the more credit you get," says Michael Noll, a researcher in computer science at Hasso Plattner Institute in Potsdam, Germany, and a researcher on the new software. "But the fact is [that] quantity does not imply quality."

Story continues below


The conventional approach also leaves the system very vulnerable to Web spammers, says Ciro Cattuto, a researcher at the Complex Network and Systems Group of the Institute for Scientific Interchange Foundation in Italy. Spammers adapt to the social behavior of other users, Cattuto says, so they see the most popular tags and start loading advertising content with those tags. To combat this, you need an algorithm that can search, rank, and present information in a usable way, says Cattuto. "The new method performs better than anything currently available--spammers rank very low, their content is not exposed, and eventually they stop polluting the system."

The new algorithm is called Spamming-resistant Expertise Analysis and Ranking (SPEAR) and is based on the well-known information-retrieval algorithm called HITS that is used by search engines like Google to rank Web pages. Like HITS, SPEAR is a method of "mutual reinforcement," says Kleinberg. In other words, the algorithm evaluates popular users and popular content and declares expert users to be the ones who identify the most important content, while important content is that which is identified by the most expert users. "The result is a way of identifying both expert users and high-quality content," he says.

Comments

  • A Better Way to Rank Expertise Online
    Ok, so I understand how they show quality creators except for the part of a trendsetter sharing something that ultimately becomes popular.

    So how do they rank popularity ?

    If they rank it by how many times it is open, then this model will fail. If they rank it by how much the item is tagged, this can be spammed as well.
    Rate this comment: 12345

    joelsapp
    08/04/2009
    Posts:6
    Avg Rating:
    4/5

Log In

Forgot your password?     Register »
Advertisement

Videos

The Marcellus Shale Gas Rush
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Featured Content
Sponsored by:
White Papers

Twelve ways to reduce costs with SQL Server 2008
Find out how to reduce costs and get more efficient

Download

Total Economic Impact of SQL Server 2008 Upgrade
Forrester reports on increasing productivity and management capabilities

Download 

Achieving Cost and Resource Savings with UC
How Office Communications Server R2 and Exchange Server can make your business smarter and more efficient

Download 

The Compelling Case for Conferencing
Read how you can improve workload support and find IT efficiencies

Download

How Windows Server 2008 R2 Helps Optimize IT and Save you Money
Read how you can improve workload support and find IT efficiencies

Download

Windows Server 2008 R2 Hyper-V Live Migration
See how Windows Server 2008 R2 and Hyper-V enable virtualization and Live Migration

Download
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.