Google Researchers Have a New Alternative to Traditional Neural Networks
Say hello to the capsule network.
AI has enjoyed huge growth in the past few years, and much of that success is owed to deep neural networks, which provide the smarts behind impressive tricks like image recognition. But there is growing concern that some of the fundamental principles that have made those systems so successful may not be able to overcome the major problems facing AI—perhaps the biggest of which is a need for huge quantities of data from which to learn (for a deep dive on this, check out our feature "Is AI Riding a One-Trick Pony?").
Google’s Geoff Hinton appears to be among those fretting about AI's future. As Wired reports, Hinton has unveiled a new take on traditional neural networks that he calls capsule networks. In a pair of new papers—one published on the arXIv, the other on OpenReview—Hinton and a handful of colleagues explain how they work.
Their approach uses small groups of neurons, collectively known as capsules, which are organized into layers to identify things in video or images. When several capsules in one layer agree on having detected something, they activate a capsule at a higher level—and so on, until the network is able to make a judgment about what it sees. Each of those capsules is designed to detect a specific feature in an image in such a way that it can recognize them in different scenarios, like from varying angles.
Hinton claims that the approach, which has been in the making for decades, should enable his networks to recognize objects in new situations with less data than regular neural nets must use.
In the papers published so far, capsule networks have been shown to keep up with regular neural networks when it comes to identifying handwritten characters, and they make fewer errors when trying to recognize previously observed toys from different angles. But for the moment, at least, they're still a bit slower than their traditional counterparts.
Now, then, comes the interesting part. Will these systems provide a compelling alternative to traditional neural networks, or will they stall? We can expect the machine-learning community to implement the work, and fast, in order to find out. Either way, those concerned about the limitations of current AI systems can be heartened by the fact that researchers are pushing the boundaries to build new deep-learning alternatives.
Deep Dive
Artificial intelligence
The inside story of how ChatGPT was built from the people who made it
Exclusive conversations that take us behind the scenes of a cultural phenomenon.
ChatGPT is about to revolutionize the economy. We need to decide what that looks like.
New large language models will transform many jobs. Whether they will lead to widespread prosperity or not is up to us.
GPT-4 is bigger and better than ChatGPT—but OpenAI won’t say why
We got a first look at the much-anticipated big new language model from OpenAI. But this time how it works is even more deeply under wraps.
Google just launched Bard, its answer to ChatGPT—and it wants you to make it better
Under pressure from its rivals, Google is updating the way we look for information by introducing a sidekick to its search engine.
Stay connected
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.