RetCL: A Selection-based Approach for Retrosynthesis via Contrastive Learning

05/03/2021
by   Hankook Lee, et al.
0

Retrosynthesis, of which the goal is to find a set of reactants for synthesizing a target product, is an emerging research area of deep learning. While the existing approaches have shown promising results, they currently lack the ability to consider availability (e.g., stability or purchasability) of the reactants or generalize to unseen reaction templates (i.e., chemical reaction rules). In this paper, we propose a new approach that mitigates the issues by reformulating retrosynthesis into a selection problem of reactants from a candidate set of commercially available molecules. To this end, we design an efficient reactant selection framework, named RetCL (retrosynthesis via contrastive learning), for enumerating all of the candidate molecules based on selection scores computed by graph neural networks. For learning the score functions, we also propose a novel contrastive training scheme with hard negative mining. Extensive experiments demonstrate the benefits of the proposed selection-based approach. For example, when all 671k reactants in the USPTO database are given as candidates, our RetCL achieves top-1 exact match accuracy of 71.3% for the USPTO-50k benchmark, while a recent transformer-based approach achieves 59.6%. We also demonstrate that RetCL generalizes well to unseen templates in various settings in contrast to template-based approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2017

Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network

The prediction of organic reaction outcomes is a fundamental problem in ...
research
06/12/2020

Learning Graph Models for Template-Free Retrosynthesis

Retrosynthesis prediction is a fundamental problem in organic synthesis,...
research
06/09/2021

Self-Improved Retrosynthetic Planning

Retrosynthetic planning is a fundamental problem in chemistry for findin...
research
01/06/2020

Retrosynthesis Prediction with Conditional Graph Logic Network

Retrosynthesis is one of the fundamental problems in organic chemistry. ...
research
07/08/2022

Few-Example Clustering via Contrastive Learning

We propose Few-Example Clustering (FEC), a novel algorithm that performs...
research
08/09/2021

ChemiRise: a data-driven retrosynthesis engine

We have developed an end-to-end, retrosynthesis system, named ChemiRise,...
research
12/07/2017

Chemical Transformation Motifs - Modelling Pathways as Integer Hyperflows

We present an elaborate framework for formally modelling pathways in che...

Please sign up or login with your details

Forgot password? Click here to reset