Improving Chemical Autoencoder Latent Space and Molecular De novo Generation Diversity with Heteroencoders

06/25/2018
by   Esben Jannik Bjerrum, et al.
0

Chemical autoencoders are attractive models as they combine chemical space navigation with possibilities for de novo molecule generation in areas of interest. This enables them to produce focused chemical libraries around a single lead compound so they can be employed early in a drug discovery project. Here it is shown that the choice of chemical representation, such as SMILES strings, has a large influence on the properties of the latent space. It is further explored to what extent translating between different chemical representations influences the latent space similarity to the SMILES strings or circular fingerprints. By employing SMILES enumeration for both the encoder and decoder, it is found that the decoder has the largest influence on the properties of the latent space. Training a sequence to sequence heteroencoder based on recurrent neural networks(RNNs) with long short-term memory cells (LSTM) to predicts different enumerated SMILES strings from the same canonical SMILES string gives the largest similarity between latent space distance and molecular similarity measured as circular fingerprints similarity.

READ FULL TEXT

page 1

page 3

research
11/21/2017

Application of generative autoencoder in de novo molecular design

A major challenge in computational chemistry is the generation of novel ...
research
02/21/2023

CHA2: CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular Design

Optimizing molecular design and discovering novel chemical structures to...
research
10/18/2020

Characterizing the Latent Space of Molecular Deep Generative Models with Persistent Homology Metrics

Deep generative models are increasingly becoming integral parts of the i...
research
09/29/2020

ChemoVerse: Manifold traversal of latent spaces for novel molecule discovery

In order to design a more potent and effective chemical entity, it is es...
research
08/18/2022

Improving Small Molecule Generation using Mutual Information Machine

We address the task of controlled generation of small molecules, which e...
research
10/19/2022

An efficient graph generative model for navigating ultra-large combinatorial synthesis libraries

Virtual, make-on-demand chemical libraries have transformed early-stage ...
research
09/26/2022

Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs

In principle, applying variational autoencoders (VAEs) to sequential dat...

Please sign up or login with your details

Forgot password? Click here to reset