Mapping the space of chemical reactions using attention-based neural networks

Organic reactions are usually assigned to classes containing reactions with similar reagents and mechanisms. Reaction classes facilitate the communication of complex concepts and efficient navigation through chemical reaction space. However, the classification process is a tedious task. It requires...

Full description

Saved in:
Bibliographic Details
Published in:Nature machine intelligence Vol. 3; no. 2; pp. 144 - 152
Main Authors: Schwaller, Philippe, Probst, Daniel, Vaucher, Alain C., Nair, Vishnu H., Kreutter, David, Laino, Teodoro, Reymond, Jean-Louis
Format: Journal Article
Language:English
Published: London Nature Publishing Group UK 01-02-2021
Nature Publishing Group
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Organic reactions are usually assigned to classes containing reactions with similar reagents and mechanisms. Reaction classes facilitate the communication of complex concepts and efficient navigation through chemical reaction space. However, the classification process is a tedious task. It requires identification of the corresponding reaction class template via annotation of the number of molecules in the reactions, the reaction centre and the distinction between reactants and reagents. Here, we show that transformer-based models can infer reaction classes from non-annotated, simple text-based representations of chemical reactions. Our best model reaches a classification accuracy of 98.2%. We also show that the learned representations can be used as reaction fingerprints that capture fine-grained differences between reaction classes better than traditional reaction fingerprints. The insights into chemical reaction space enabled by our learned fingerprints are illustrated by an interactive reaction atlas providing visual clustering and similarity searching. Organic chemical reactions can be divided into classes that allow chemists to use the knowledge they have about optimal conditions for specific reactions in the context of other reactions of similar type. Schwaller et al. present here an efficient method based on transformer neural networks that learns a chemical space in which reactions of a similar class are grouped together.
ISSN:2522-5839
2522-5839
DOI:10.1038/s42256-020-00284-w