Augmented Transformer Achieves 97 and Classical Retro-Synthesis

03/05/2020
by   Igor V. Tetko, et al.
0

We investigated the effect of different augmentation scenarios on predicting (retro)synthesis of chemical compounds using SMILES representation. We showed that augmentation of not only input sequences but also, importantly, of the target data eliminated the effect of data memorization by neural networks and improved their generalization performance for prediction of new sequences. The Top-5 accuracy was 85.4 identifying principal transformation for classical retro-synthesis) for USPTO-50k test dataset and was achieved by a combination of SMILES augmentation and beam search. The same approach also outperformed best published results for prediction of direct reactions from the USPTO-MIT test set. Our model achieved 90.4 Top-5 accuracy for the USPTO-MIT separated set. The appearance frequency of the most abundantly generated SMILES was well correlated with the prediction outcome and can be used as a measure of the quality of reaction prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2021

Permutation invariant graph-to-sequence model for template-free retrosynthesis and reaction prediction

Synthesis planning and reaction outcome prediction are two fundamental p...
research
07/23/2022

The prediction of the quality of results in Logic Synthesis using Transformer and Graph Neural Networks

In the logic synthesis stage, structure transformations in the synthesis...
research
11/06/2018

Molecular Transformer for Chemical Reaction Prediction and Uncertainty Estimation

Organic synthesis is one of the key stumbling blocks in medicinal chemis...
research
07/02/2019

Predicting Retrosynthetic Reaction using Self-Corrected Transformer Neural Networks

Synthesis planning is the process of recursively decomposing target mole...
research
08/03/2023

Data Augmentation for Human Behavior Analysis in Multi-Person Conversations

In this paper, we present the solution of our team HFUT-VUT for the Mult...
research
04/19/2022

G2GT: Retrosynthesis Prediction with Graph to Graph Attention Neural Network and Self-Training

Retrosynthesis prediction is one of the fundamental challenges in organi...
research
10/10/2019

First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation

In this paper, we propose a novel data augmentation method for training ...

Please sign up or login with your details

Forgot password? Click here to reset