AI can tell when actors are kissing—and maybe when you are, too
While object recognition in video has rapidly advanced, scene detection, or knowing what’s actually happening on screen, has lagged behind. But being able to analyze and recognize actions in footage could prove useful for applications like video editing. So Amir Ziai, a Stanford student at the time of research and now a senior data scientist at Netflix, took it upon himself to advance the state of the art, specifically in detecting Hollywood kissing scenes. The study may seem rather light-hearted or silly, but it has important implications.
Ziai selected a subset of 100 movies and labeled their various non-kissing and kissing scenes between 10 and 20 seconds in length. He then extracted image and audio stills for every second of each scene, and used them to train a machine-learning algorithm. The resulting model was able to identify which seconds depicted kissing and group them into scenes, achieving a high level of accuracy.
The study shows how quickly the means of analyzing footage for specific, even intimate, actions have advanced. Couple that with surveillance footage, and the implications quickly turn Orwellian. In fact, in a new report, the ACLU sounded the alarm on a future in which camera owners would be able to rapidly identify unusual behavior or seek out embarrassing moments. Like deepfakes, it’s yet another example of a situation where technologists should think about the consequences of their work.
To have more stories like this delivered directly to your inbox, sign up for our Webby-nominated AI newsletter The Algorithm. It's free.
Deep Dive
Artificial intelligence
Large language models can do jaw-dropping things. But nobody knows exactly why.
And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.
Google DeepMind’s new generative model makes Super Mario–like games from scratch
Genie learns how to control games by watching hours and hours of video. It could help train next-gen robots too.
What’s next for generative video
OpenAI's Sora has raised the bar for AI moviemaking. Here are four things to bear in mind as we wrap our heads around what's coming.
Stay connected
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.