Google and Facebook aren’t the only ones vying to be the standard bearer for the hottest AI technique around. China’s leading Internet search company, Baidu, which is also investing heavily in a popular and powerful machine-learning technology called deep learning, today released some key code that it uses to make this AI software run very efficiently.
Baidu’s code was recently used to build an impressive speech-recognition system called Deep Speech 2. For some short sentences, this system is better than most humans at recognizing speech correctly (see “Baidu’s Deep-Learning System Rivals People at Speech Recognition”). This is an especially useful technology for Baidu, because it offers a better way for the company’s many millions of users to access its services, especially on mobile. Typing Chinese characters on a smartphone is tricky and complex, and many people in China already prefer to use their voice to send short messages or to search the Web for information.
Deep learning allows computers to perform impressive feats, such as transcribing voice or recognizing objects in images almost flawlessly. A large simulated neural network is fed input, such as audio of a certain word or images showing a particular object, and over time this network will “learn” to recognize almost any new example.
Baidu’s code, called Warp-CTC, is essentially an improved implementation of a deep-learning algorithm developed some time ago that’s been designed to run very quickly on the latest computer chips. A startup called Nervana, which offers a deep-learning framework to companies that cannot or do not want to develop their own, is already using Warp-CTC as part of its technology.
Baidu and other tech companies are making the code they use for deep learning freely available (see “Facebook Joins Stampede of Tech Giants Giving Away Artificial Intelligence Technology”) because they are eager to benefit from public research. They also want to encourage researchers and startups that might eventually be acquired to develop machine-learning systems that will be compatible with their own technology.
The key ingredient needed to create powerful AI applications using such software is the data owned by the likes of Facebook, Google, and Baidu, but even some of this data might increasingly be released (see “Giant Yahoo Data Dump Aims to Help Computers Know What You Want”).
This startup wants to copy you into an embryo for organ harvesting
With plans to create realistic synthetic embryos, grown in jars, Renewal Bio is on a journey to the horizon of science and ethics.
VR is as good as psychedelics at helping people reach transcendence
On key metrics, a VR experience elicited a response indistinguishable from subjects who took medium doses of LSD or magic mushrooms.
This artist is dominating AI-generated art. And he’s not happy about it.
Greg Rutkowski is a more popular prompt than Picasso.
This nanoparticle could be the key to a universal covid vaccine
Ending the covid pandemic might well require a vaccine that protects against any new strains. Researchers may have found a strategy that will work.
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.