Skip to Content
Artificial intelligence

Now any business can access the same type of AI that powered AlphaGo

March 6, 2019

A startup called CogitAI has developed a platform that lets companies use reinforcement learning, the technique that gave AlphaGo mastery of the board game Go.

Gaining experience: AlphaGo, an AI program developed by DeepMind, taught itself to play Go by practicing. It’s practically impossible for a programmer to manually code in the best strategies for winning. Instead, reinforcement learning let the program figure out how to defeat the world’s best human players on its own. 

Drug delivery: Reinforcement learning is still an experimental technology, but it is gaining a foothold in industry. Amazon recently launched a reinforcement-learning platform, but it is aimed more at researchers and academics. CogitAI’s first commercial customers include those working in robotics for drug manufacturing. Its platform lets the robot figure out the optimal way to process drug orders.

Brain trust: CogitAI was founded by several smart AI experts, including Peter Stone, a professor at the University of Texas. Rich Sutton, one of the fathers of reinforcement learning, is an advisor.

Learn for life: Stone says CogitAI’s platform is also the first to incorporate the ability to apply what it has learned in one situation to a new one, a first step toward “lifelong learning” for AI programs. “The platform has all of the cutting-edge RL algorithms and then some of our steps toward continual learning,” he says.

For more on the world of AI, sign up here to our twice-weekly AI newsletter, The Algorithm.

Deep Dive

Artificial intelligence

The inside story of how ChatGPT was built from the people who made it

Exclusive conversations that take us behind the scenes of a cultural phenomenon.

AI is dreaming up drugs that no one has ever seen. Now we’ve got to see if they work.

AI automation throughout the drug development pipeline is opening up the possibility of faster, cheaper pharmaceuticals.

GPT-4 is bigger and better than ChatGPT—but OpenAI won’t say why

We got a first look at the much-anticipated big new language model from OpenAI. But this time how it works is even more deeply under wraps.

The original startup behind Stable Diffusion has launched a generative AI for video

Runway’s new model, called Gen-1, can change the visual style of existing videos and movies.

Stay connected

Illustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at customer-service@technologyreview.com with a list of newsletters you’d like to receive.