Researchers and companies who want to track the conversations going on online are intensely interested in data from Twitter. It’s been hard to get deep access to that information, however. Onstage today at Defrag, a Web conference in Denver, Colorado, Twitter announced that it’s formed a partnership to make more of its data available for analysis.
Ryan Sarver, a member of Twitter’s platform team, said that the move is aimed at helping people who are analyzing huge bodies of Twitter posts in order to perform sentiment analysis, identify trends, and other sorts of data-intensive tasks. “We haven’t been able to serve that market well in the past,” Sarver said.
Twitter already let people pick up portions of its data for free through several partial feeds, such as the Spritzer, which skims a portion of the posts moving through Twitter at any given moment and passes them on. Before today’s announcement, however, those wanting more had to make deals with Twitter to get more data. Google and Bing, for example, made special agreements to incorporate real-time feeds from Twitter on its search results page.
That data hasn’t been readily available for several reasons. First, it’s valuable and makes up some portion of Twitter’s business model. Second, Twitter already struggles with overload and wouldn’t be able to handle constant requests for its full feed.
Twitter will open up more of its data through a partnership with Gnip, a social data company based in Boulder, Colorado. Gnip will help Twitter distribute the information, minimizing the stress that this places on Twitter’s resources. Twitter is also granting Gnip a license to sell the data.
Gnip is starting out by offering three new feeds: the Twitter halfhose, which gives 50 percent of the full Twitter firehose, the Twitter Decahose, which is 10 percent of the full Twitter stream, and the Mentionhose, which is a full real-time stream of all tweets mentioning a user, including replies and retweets.
“We will provide more transparent, consistent access to Twitter data than has ever been available before,” said Gnip CEO Jud Valeski. He says that all of these new offerings give much more data than was previously available to most people. He expects the Mentionhose to be particularly interesting to companies tracking trends, looking for influential people on Twitter, and monitoring engagement with a product.
Valeski said, “There is insatiable demand for lots of data to understand how conversations online are taking place and transpiring.”