Google and Facebook aren’t the only ones vying to be the standard bearer for the hottest AI technique around. China’s leading Internet search company, Baidu, which is also investing heavily in a popular and powerful machine-learning technology called deep learning, today released some key code that it uses to make this AI software run very efficiently.
Baidu’s code was recently used to build an impressive speech-recognition system called Deep Speech 2. For some short sentences, this system is better than most humans at recognizing speech correctly (see “Baidu’s Deep-Learning System Rivals People at Speech Recognition”). This is an especially useful technology for Baidu, because it offers a better way for the company’s many millions of users to access its services, especially on mobile. Typing Chinese characters on a smartphone is tricky and complex, and many people in China already prefer to use their voice to send short messages or to search the Web for information.
Deep learning allows computers to perform impressive feats, such as transcribing voice or recognizing objects in images almost flawlessly. A large simulated neural network is fed input, such as audio of a certain word or images showing a particular object, and over time this network will “learn” to recognize almost any new example.
Baidu’s code, called Warp-CTC, is essentially an improved implementation of a deep-learning algorithm developed some time ago that’s been designed to run very quickly on the latest computer chips. A startup called Nervana, which offers a deep-learning framework to companies that cannot or do not want to develop their own, is already using Warp-CTC as part of its technology.
Baidu and other tech companies are making the code they use for deep learning freely available (see “Facebook Joins Stampede of Tech Giants Giving Away Artificial Intelligence Technology”) because they are eager to benefit from public research. They also want to encourage researchers and startups that might eventually be acquired to develop machine-learning systems that will be compatible with their own technology.
The key ingredient needed to create powerful AI applications using such software is the data owned by the likes of Facebook, Google, and Baidu, but even some of this data might increasingly be released (see “Giant Yahoo Data Dump Aims to Help Computers Know What You Want”).
10 Breakthrough Technologies 2024
Every year, we look for promising technologies poised to have a real impact on the world. Here are the advances that we think matter most right now.
Scientists are finding signals of long covid in blood. They could lead to new treatments.
Faults in a certain part of the immune system might be at the root of some long covid cases, new research suggests.
AI for everything: 10 Breakthrough Technologies 2024
Generative AI tools like ChatGPT reached mass adoption in record time, and reset the course of an entire industry.
OpenAI teases an amazing new generative video model called Sora
The firm is sharing Sora with a small group of safety testers but the rest of us will have to wait to learn more.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.