Technology Review - Published By MIT
Log in to My.TechnologyReview.com | Register
Advertisement
« Back 1 [2]

Friday, December 01, 2006

What Comes After Web 2.0?

Continued from page 1

By Wade Roush

smaller text tool iconmedium text tool iconlarger text tool icon

In this way, the Piggy Bank researchers hope, Web users can begin to get a taste of the Semantic Web in action, without having to wait for the authors of the billions of documents on the Web to create metadata. The curious can download a Piggy Bank extension for the Firefox Web browser; once the extension is installed, users can choose from a number of "screenscrapers" that extract information from specific sites like LinkedIn and Flickr (a popular photo-sharing site). Piggy Bank stores this "pure information," such as photos or contact names, inside the Web browser in RDF format, theoretically allowing users to mix data from independent sources to create their own "instant mashups" similar to the LinkedIn-Google Maps example.

Unfortunately, there aren't yet any tools that make it easy for nonprogrammers to reuse the RDF data in such mashups. And in my own tests of Piggy Bank, the screenscrapers failed to activate. I'm sure that's because I missed something in the instructions--but the problem does illustrate how much more work is needed before such tools will be ready for public consumption.

A second category of post-Web 2.0 projects focuses not on helping machines understand the meaning and the uses of existing Web content, but on recruiting real people to add their intelligence to information before it's used. The best known example is Amazon Mechanical Turk, a kind of high-tech temp agency introduced by the online retailer in 2005. The service allows people with tasks and questions that computers can't handle--for example, spotting inappropriate images in a collection of photos--to hire other Web users to help.

The employment is extremely temporary--less than an hour per task, in most cases--and the pay is ridiculously low: solutions typically earn the worker only a few cents. But the point isn't to provide Internet addicts with a second income: it's to harness users' brainpower for a few spare moments to carry out simple tasks that remain far beyond the capabilities of artificial-intelligence software. (In fact, Amazon calls its project a form of "artificial artificial intelligence.")

Some tasks are really marketing or product research in disguise. One questioner, for example, asks, "What would make your e-mail better?" Others offer better illustrations of the logic behind breaking up a big data-classification task and distributing it to hundreds of people. One task, apparently from someone trying to make it possible to share information between various Yellow Pages-style directories, asks users to match categories from one directory--say, "Delicatessens"--with the closest equivalents in another--for example, "Delis" or "Small Restaurants." A computer couldn't tackle such a task without years of training on the mundane facts of human existence, such as the fact that a delicatessen is indeed one form of a small restaurant. A human, however, can find the right matches in seconds.

Another project that attempts to persuade humans to add meaning to raw data is the Google Image Labeler. It entices users to label digital photographs according to their content by making the task into a simple game in which contestants must both collaborate and compete. Like Amazon Mechanical Turk, the Image Labeler has a community of fans who enjoy it as a game. And there's nothing wrong with making potentially dull tasks entertaining, if that's what it takes to motivate "workers." But the Image Labeler and the Mechanical Turk will have to grow beyond their toylike demonstration stage before they have a real impact on the Web's usability.

It's not surprising that observers are reaching for new labels to describe the work going on beyond the boundaries of today's Web 2.0. But most of these projects are so far from producing practical tools--let alone services that could be commercialized--that it's premature to say they represent a "third generation" of Web technology. For that, judging from today's state of the art, we'll need to wait another few years.

« Back 1 [2]

Comments

  • FOAF: Are we ready?
    grausc01 on 12/01/2006 at 9:37 AM
    Posts:
    12
    Avg Rating:
    4/5
    I'm a young guy and am often ready to give out information about myself to a computer system in order for others to be able to find me.  When I talk to my mother about what I do, she cries invasion of privacy and that all this data collecting is only leading to a surveillance state.  Having worked with databases and surveys to reach people for marketing purposes I understand how useful it can be to have lots of exact information.  But until people are willing to feel that nothing is private (except account numbers, SS#, etc), trying to link everyone together will be a long way off.  I think it would be more important to work on FOAF-like meta data for products so that one location links together instruction manuals, customer service, updates, user reviews, known problems, and price comparisons all in one place; not like how we have websites with unique information which aren't interconnected in any manner to provide a comprehensive view of a product.
    Rate this comment: 12345
    • Re: FOAF: Are we ready?
      hosro59@comcast.net on 12/01/2006 at 11:54 AM
      Posts:
      6
      We are already surveilled. Would metadata provide even richer
      personal data files, as well as subject data we are interested in?

      I would like that there be protection of personal data.  But, there seems to be as much genious subverting any protection as there is developed.

      Would a Web 2.0 or 3.0 be one that would have a means to include or exclude commercial data, ads?
      Rate this comment: 12345
    • Re: FOAF: Are we ready?
      lund1967 on 12/01/2006 at 1:05 PM
      Posts:
      5
      Avg Rating:
      2/5
      I agree.  Having all of that information in one convenient location is just asking to be hacked.  And because the database does not store SSN's or credit card numbers, security may not be as high a priority as it ought.  I don't think I would use anything like this.
      Rate this comment: 12345
    • Re: FOAF: Are we ready?
      rckthms62 on 12/04/2006 at 2:13 PM
      Posts:
      1
      Privacy is a good point, and I am very worried about it, but we will have to find that balance between privacy and meta-data access. We do need browsers that can give us perceived and accurate preferences without us really asking for it. The internet is filled with information that is too “all-over-the-place”. We need a more logical system that can better understand search request. Mashup is one way to achieve this lack of relevant information gathering. New systems like the Chamberecommerce.com website is a great example in the Mashup concept. It takes everything you need to create an ecommerce website and places it all in one package, eliminating the need of outsourcing such services as PayPal functions. The future will be filled with meta-data and mashup products and functions.     
      Rate this comment: 12345
  • Piggy Bank Firefox Extension?
    rjstone on 12/01/2006 at 1:29 PM
    Posts:
    2
    Does anyone have any more detail about what to search for on addons.mozilla.org for a Piggy Bank Firefox extension?  I tried "piggy bank", "screenscraper", "scraper", "mashup", among others, and didn't get anything. 
    Rate this comment: 12345
  • We're doing it already
    Wonderkid on 12/01/2006 at 3:11 PM
    Posts:
    2
    Drop us a line at owonder.com/contact and we'll let you know when our FOAF/RDF powered service is ready to rock and roll.
    Rate this comment: 12345
  • Meta Validation
    dtiffany on 12/03/2006 at 3:20 PM
    Posts:
    1
    We can add all of the metadata and/or tags we want to web resources but that does not mean that the "data about the data" honestly or accurately describes the resource. This is why Google does not place much importance on the metadata already contained in html document headers in terms of search ranking etc. Ensuring or verifying the validity of metadata would be a task equal to that of actually creating it but would have to be repeated continually.
    Rate this comment: 12345
  • Would Google + Facebook = Web 2.2 ??
    cullin on 12/03/2006 at 4:52 PM
    Posts:
    1
    Wade Roush writes that his "current Web calendar, for example, knows very little about me, except that I have appointments today at 8:30 A.M. and 4:00 P.M. A Semantic Web calendar would not only know my name, but would also have a store of standardized metadata about me, such as "lives in: Las Vegas," "born in: 1967," "likes to eat: Thai food," "belongs to: Stonewall Democrats," and "favorite TV show: Battlestar Galactica."

    As I read this passage I looked up to the tabs along the top of my browser and noticed I still had FaceBook.com open ... and it occured to me that a similar set of factoids is already stored in my FaceBook profile (see http://humber.facebook.com/profile.php?id=507133544) ... To wit: "lives in: Mississauga," "born in: 1967," "likes to eat: Italian food," "favorite TV show: The Daily Show," and political views: "Libertarian".

    More to the point FaceBook already allows for the type of social searching that Wade wishes his calendar was capable of. For instance with one click on my profile I was able to determine the names of 357 people within the Toronto sub grouping of FaceBook who share my interest in the band Rush.

    As I thought about Wade's argument I started to imaging the fusion between FaceBook's profile comparison engine and Google Calendar. As I thought about it more it occurs to me that among the biggest deficiencies of the Google Apps suite is the lack of a FOAF (Friend of a Friend)  capability. And yet in the tab adjacent to my Gmail window sits FaceBook ... almost screaming "Buy me! Merge my capabilities with yours!"

    Piggy Bank is an interesting idea. But unless FireFox 2.1 ships with the extension preloaded it is little more then another geek tool.

    I guess my point in posting this message is this: Google - consider buying FaceBook.com ... and to Mark Zuckerberg (founder of FaceBook) - The price you extract for FaceBook is just the first phase in your road to riches. Which begs the question Mark: Who can leverage (and monetize) the full power of FaceBook over the next 20 years - Google, Yahoo or Microsoft? Put another way: How much would $1 billion in Google stock be worth 20 years from now? How much would $1 billion in Yahoo stock be worth 20 years from now? How much would $1 billion in Microsoft stock be worth 20 years from now?
    Rate this comment: 12345
  • Web 2.0 deep into trusted environments..
    vikrant.goswami on 12/04/2006 at 1:22 AM
    Posts:
    1
    Agree that web 2.0 has not made any impact yet on the Internet, but just happen to look at the more trusted corporate environments .
    Web 2.0 technologies is being used extensively by knowledge management solution vendors like Kanisa  (bought over long time market leaders in Service Resolution Management space i.e. Serviceware), not known as Knova software. With companies like Ontoprise and Cerebra raising huge VC funding and projecting web 2.0 as enabling EAI platform it sure has gret potential. Not to add that giants like  Oracle have integrated Web 2.0 i.e. RDF capabilities and inferencing into Oracle 10g Spatial DB suites. Thus web 2.0 is slowly but surely making inroads into the industry. Not to mention that Google dabbles into Semantic web  Look at
    http://blogs.smh.com.au/mashup/archives//004396.html

    Web 2.0 technologies would allow automatic creationg of meta-data from existing repositories with no meta-data .

    Not sure how long it would somebody take the untrusted environment (heard of Semantic mediation to bride the meta-data gap between two representations of the same domain) of the internet to come up with a commercially viable solutions.....

    But with such movement in the industry, Web 2.0 is sure to break in into Internet , sooner than later
    Rate this comment: 12345
  • Well-tagged content is available for medicine
    kgilpin on 06/11/2007 at 2:53 PM
    Posts:
    1
    For example, the National Library of Medicine has extensively tagged the scientific and medical literature. Some tags are redundant with information that can be gleaned from the title or abstract of the article. However, some tags capture meta-properties such as the intended audience of a paper that would be very hard to detect automatically.

    Curbside.MD http://www.curbside.md is a medical search engine that makes extensive use of concept recognition and tagging.
    Rate this comment: 12345
Advertisement

Current Issue

Technology Review July/August 2008
The Business of Social Networks
The future of the Web is social. But can social-networking sites ever make money?
•  Subscribe
Save 41%
•  Table of Contents
•  MIT News

Magazine Services

Career Resources

MIT Technology Insider

Stories and breaking news from inside MIT about the latest research, innovations, and startups--in a convenient monthly e-newsletter. Subscribe today
Advertisement

More Technology News from Forbes

Advertisement
Advertisement
Advertisement
TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology