Skip to Content

Study Reveals Techie Terms Censored Online in China

Researchers reverse-engineered a list of keywords blacklisted on various messaging platforms.

China’s surveillance of its citizens’ digital activities is common knowledge. However, questions remain concerning what content is targeted by government censors and how these blacklists change in response to current events.

screenshot of Sina Weibo
Party line: A screenshot of Sina Weibo shows messages posted by a member of the Jiu San Society, a Chinese political party.

A new study released this month in First Monday uncovers more than 4,000 unique keywords censored over the last year and a half on Chinese instant messaging platforms. Focusing on Skype and the microblogging service Sina Weibo, the researchers cultivated their keyword list using reverse-engineering techniques such as packet sniffing, which captures and analyzes packets of data as they pass through a network.

More than 20% of the terms targeted on Sina Weibo, it turns out, related to technology, including specific URLs, spyware, and technical terms. Some of these keywords, such as “Chinese language Wikipedia” and “Google Blogger,” referred to popular websites dedicated to the open dissemination of information. (Censorship of Chinese Wikipedia was first spotted nearly ten years ago on the anniversary of the Tianmen Square protests.)

Generic terms like “system,” “administrator,” and “system notification” also appeared on the keyword list. The researchers hypothesize that these more general keywords might be used to catch anyone attempting to impersonate a Sina Weibo administrator and thus wield power over other users’ accounts.

Other common words targeted by censors include “internet,” “chat,” “world wide web,” and “Chinese person.” Additionally, the researchers found that the censor’s keyword lists fluctuated in response to major events. For example, after the Arab Spring began in late 2010, and some began calling for similar protests (or Jasmine Rallies) in China, dozens of related keywords were added to the censorship lists. 69 of these keywords were then abruptly removed for several weeks in May 2011, which the researchers interpreted as a possible attempt to monitor protester mobilization.

Keep Reading

Most Popular

images created by Google Imagen
images created by Google Imagen

The dark secret behind those cute AI-generated animal images

Google Brain has revealed its own image-making AI, called Imagen. But don't expect to see anything that isn't wholesome.

AGI is just chatter for now concept
AGI is just chatter for now concept

The hype around DeepMind’s new AI model misses what’s actually cool about it

Some worry that the chatter about these tools is doing the whole field a disservice.

Hoan Ton-That, CEO of Clearview AI
Hoan Ton-That, CEO of Clearview AI

The walls are closing in on Clearview AI

The controversial face recognition company was just fined $10 million for scraping UK faces from the web. That might not be the end of it.

spaceman on a horse generated by DALL-E
spaceman on a horse generated by DALL-E

This horse-riding astronaut is a milestone in AI’s journey to make sense of the world

OpenAI’s latest picture-making AI is amazing—but raises questions about what we mean by intelligence.

Stay connected

Illustration by Rose WongIllustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at customer-service@technologyreview.com with a list of newsletters you’d like to receive.