Many a controversy has raged on social media platforms such as Twitter. Some last for weeks or months, others blow themselves in an afternoon. And yet most go unnoticed by most people. That would change if there was a reliable way of spotting controversies in the Twitterstream in real time.
That could happen thanks to the work of Kiran Garimella and pals at Aalto University in Finland. These guys have found a way to spot the characteristics of a controversy in a collection of tweets and distinguish this from a noncontroversial conversation.
Various researchers have studied controversies on Twitter but these have all focused on preidentified arguments, whereas Garimella and co want to spot them in the first place. Their key idea is that the structure of conversations that involve controversy are different from those that are benign.
And they think this structure can be spotted by studying various properties of the conversation, such as the network of connections between those involved in a topic; the structure of endorsements, who agrees with whom; and the sentiment of the discussion, whether positive and negative.
They test this idea by first studying ten conversations associated with hashtags that are known to be controversial and ten that are known to be benign. Garimella and co map out the structure of these discussion by looking at the networks of retweets, follows, keywords and combinations of these.
These networks allow further study. In particular, the retweet and follow graphs reveal clusters within the networks that might indicate a form of polarization among the users. Sure enough, the networks associated with controversial topics can be clearly partitioned into groups on either side of the debate.
These images (shown above) are beautiful representations of controversy, like fireworks in the Twittersphere. The most controversial are a, b, e, and f which are associated with #beefban, a discussion about the banning of beef in India, and with #russia_march, about protests in Russia.
The others are less controversial: c and g are associated with #sxsw, the tag for the South by South West festival of creativity, and d and h with #germanwings that involved conversations about a dramatic air crash last year but which were not generally controversial.
In all cases, the images clearly show the polarization, or lack of, in the debate.
Garimella and co developed tools for measuring the size of this polarization and hence the controversy. They go on to test these tools on a number of ground truth datasets to see how well they describe controversy.
In general, their methods spot controversy better than existing tools but are by no means perfect. So further work is clearly needed.
But who’d have guessed that controversial topics could look so striking? With tools like this, all Garimella and co have to do is watch the Twitterstream and wait for the fireworks to begin.
Ref: arxiv.org/abs/1507.05224 : Quantifying Controversy in Social Media
It will soon be easy for self-driving cars to hide in plain sight. We shouldn’t let them.
If they ever hit our roads for real, other drivers need to know exactly what they are.
Maximize business value with data-driven strategies
Every organization is now collecting data, but few are truly data driven. Here are five ways data can transform your business.
Cryptocurrency fuels new business opportunities
As adoption of digital assets accelerates, companies are investing in innovative products and services.
Where to get abortion pills and how to use them
New US restrictions could turn abortion into do-it-yourself medicine, but there might be legal risks.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.