Instagram users have used hashtags to classify billions of images, unintentionally creating great labels to train the company’s computer vision algorithms.
The news: Facebook created a data set of 3.5 billion pictures and 17,000 hashtags pulled from public Instagram accounts to improve how well it can recognize objects in images, the company announced on stage at its annual F8 developer conference today. Using a subset of that data set, Facebook was able to label 85.4 percent of photos correctly, the highest level the company has achieved to date.
But: This free labor does have drawbacks: some hashtags like #photooftheday or #tbt—short for throwback Thursday—don’t describe what’s in the image and can confuse the algorithm.
Why it matters: More accurate computer vision could do everything from surface more relevant content to help keep abusive posts off the site. Of course, Mark Zuckerberg has already had to answer for how users’ data gets handled, and not everyone might be happy to find out that their vacation photos are being put to work.
This artist is dominating AI-generated art. And he’s not happy about it.
Greg Rutkowski is a more popular prompt than Picasso.
What does GPT-3 “know” about me?
Large language models are trained on troves of personal data hoovered from the internet. So I wanted to know: What does it have on me?
An AI that can design new proteins could help unlock new cures and materials
The machine-learning tool could help researchers discover entirely new proteins not yet known to science.
DeepMind’s new chatbot uses Google searches plus humans to give better answers
The lab trained a chatbot to learn from human feedback and search the internet for information to support its claims.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.