The Synthesizability of Molecules Proposed by Generative Models

02/17/2020
by   Wenhao Gao, et al.
83

The discovery of functional molecules is an expensive and time-consuming process, exemplified by the rising costs of small molecule therapeutic discovery. One class of techniques of growing interest for early-stage drug discovery is de novo molecular generation and optimization, catalyzed by the development of new deep learning approaches. These techniques can suggest novel molecular structures intended to maximize a multi-objective function, e.g., suitability as a therapeutic against a particular target, without relying on brute-force exploration of a chemical space. However, the utility of these approaches is stymied by ignorance of synthesizability. To highlight the severity of this issue, we use a data-driven computer-aided synthesis planning program to quantify how often molecules proposed by state-of-the-art generative models cannot be readily synthesized. Our analysis demonstrates that there are several tasks for which these models generate unrealistic molecular structures despite performing well on popular quantitative benchmarks. Synthetic complexity heuristics can successfully bias generation toward synthetically-tractable chemical space, although doing so necessarily detracts from the primary objective. This analysis suggests that to improve the utility of these models in real discovery workflows, new algorithm development is warranted.

READ FULL TEXT

page 8

page 35

page 36

page 37

research
07/20/2020

Visualizing Deep Graph Generative Models for Drug Discovery

Drug discovery aims at designing novel molecules with specific desired p...
research
09/15/2020

Scaffold-constrained molecular generation

One of the major applications of generative models for drug Discovery ta...
research
02/07/2020

A deep-learning view of chemical space designed to facilitate drug discovery

Drug discovery projects entail cycles of design, synthesis, and testing ...
research
12/01/2022

Re-evaluating sample efficiency in de novo molecule generation

De novo molecule generation can suffer from data inefficiency; requiring...
research
12/20/2021

A Constraint Programming Approach to Weighted Isomorphic Mapping of Fragment-based Shape Signatures

Fragment-based shape signature techniques have proven to be powerful too...
research
01/28/2022

FastFlows: Flow-Based Models for Molecular Graph Generation

We propose a framework using normalizing-flow based models, SELF-Referen...
research
04/05/2022

Generative Enriched Sequential Learning (ESL) Approach for Molecular Design via Augmented Domain Knowledge

Deploying generative machine learning techniques to generate novel chemi...

Please sign up or login with your details

Forgot password? Click here to reset