Learning to Make Generalizable and Diverse Predictions for Retrosynthesis

10/21/2019
by   Benson Chen, et al.
9

We propose a new model for making generalizable and diverse retrosynthetic reaction predictions. Given a target compound, the task is to predict the likely chemical reactants to produce the target. This generative task can be framed as a sequence-to-sequence problem by using the SMILES representations of the molecules. Building on top of the popular Transformer architecture, we propose two novel pre-training methods that construct relevant auxiliary tasks (plausible reactions) for our problem. Furthermore, we incorporate a discrete latent variable model into the architecture to encourage the model to produce a diverse set of alternative predictions. On the 50k subset of reaction examples from the United States patent literature (USPTO-50k) benchmark dataset, our model greatly improves performance over the baseline, while also generating predictions that are more diverse.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2017

Retrosynthetic reaction prediction using neural sequence-to-sequence models

We describe a fully data driven model that learns to perform a retrosynt...
research
06/12/2019

A Model to Search for Synthesizable Molecules

Deep generative models are able to suggest new organic molecules by gene...
research
05/23/2018

Predicting Electron Paths

Chemical reactions can be described as the stepwise redistribution of el...
research
06/12/2020

Learning Graph Models for Template-Free Retrosynthesis

Retrosynthesis prediction is a fundamental problem in organic synthesis,...
research
08/31/2020

Future Frame Prediction of a Video Sequence

Predicting future frames of a video sequence has been a problem of high ...
research
10/25/2019

Multimodal Image Outpainting With Regularized Normalized Diversification

In this paper, we study the problem of generating a set ofrealistic and ...
research
05/24/2021

One2Set: Generating Diverse Keyphrases as a Set

Recently, the sequence-to-sequence models have made remarkable progress ...

Please sign up or login with your details

Forgot password? Click here to reset