A Translation Algorithm Can Predict the “Language” of a Chemical Reaction
By thinking of organic chemistry as words and sentences instead of atoms and molecules, researchers have found a way for artificial intelligence to predict chemical reactions.
In a paper published on arXiv by researchers at IBM and being presented at this week’s Neural Information Processing Systems (NIPS) conference, the researchers demonstrate that by treating reaction predictions as a translation problem, they could come up with the correct reaction more often than was possible with previous models.
“Intuitively, there is an analogy between a chemist’s understanding of a compound and a language speaker’s understanding of a word,” the researchers write.
Using a neural network often used in machine translation, the researchers trained the system on a data set that included 395,496 reactions. From that data, the neural net had to learn the “syntax” of reactions to be able to predict unseen compounds. The algorithm gave researchers a list of the top five most likely reactions, and the top prediction was correct 80 percent of the time, beating another model that tried to predict reactions by six percentage points.
There are millions of chemical reactions that have yet to be documented, so this approach could help speed up research for things like drug discovery. But researchers say that as more data gets added to the models, more double-checking will have to take place. Teodoro Laino, one of the researchers, told IEEE Spectrum that they “didn't create this tool to replace organic chemists, but to help them.”
Keep Reading
Most Popular
Large language models can do jaw-dropping things. But nobody knows exactly why.
And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.
The problem with plug-in hybrids? Their drivers.
Plug-in hybrids are often sold as a transition to EVs, but new data from Europe shows we’re still underestimating the emissions they produce.
Google DeepMind’s new generative model makes Super Mario–like games from scratch
Genie learns how to control games by watching hours and hours of video. It could help train next-gen robots too.
How scientists traced a mysterious covid case back to six toilets
When wastewater surveillance turns into a hunt for a single infected individual, the ethics get tricky.
Stay connected
Get the latest updates from
MIT Technology Review
Discover special offers, top stories, upcoming events, and more.