Retrieval-based Controllable Molecule Generation

08/23/2022
by   Zichao Wang, et al.
21

Generating new molecules with specified chemical and biological properties via generative models has emerged as a promising direction for drug discovery. However, existing methods require extensive training/fine-tuning with a large dataset, often unavailable in real-world generation tasks. In this work, we propose a new retrieval-based framework for controllable molecule generation. We use a small set of exemplar molecules, i.e., those that (partially) satisfy the design criteria, to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria. We design a retrieval mechanism that retrieves and fuses the exemplar molecules with the input molecule, which is trained by a new self-supervised objective that predicts the nearest neighbor of the input molecule. We also propose an iterative refinement process to dynamically update the generated molecules and retrieval database for better generalization. Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning. On various tasks ranging from simple design criteria to a challenging real-world scenario for designing lead compounds that bind to the SARS-CoV-2 main protease, we demonstrate our approach extrapolates well beyond the retrieval database, and achieves better performance and wider applicability than previous methods.

READ FULL TEXT

page 27

page 31

research
01/18/2018

Multi-Objective De Novo Drug Design with Conditional Graph Generative Model

Recently, deep generative models have revealed itself as a promising way...
research
09/15/2020

Scaffold-constrained molecular generation

One of the major applications of generative models for drug Discovery ta...
research
02/08/2020

Composing Molecules with Multiple Property Constraints

Drug discovery aims to find novel compounds with specified chemical prop...
research
03/26/2018

Fréchet ChemblNet Distance: A metric for generative models for molecules

The new wave of successful generative models in machine learning has inc...
research
11/04/2022

De novo PROTAC design using graph-based deep generative models

PROteolysis TArgeting Chimeras (PROTACs) are an emerging therapeutic mod...
research
04/12/2021

Boltzmann Tuning of Generative Models

The paper focuses on the a posteriori tuning of a generative model in or...
research
09/14/2022

A Transfer Function Design Using A Knowledge Database based on Deep Image and Primitive Intensity Profile Features Retrieval

Transfer function (TF) plays a key role for the generation of direct vol...

Please sign up or login with your details

Forgot password? Click here to reset