GEN: Highly Efficient SMILES Explorer Using Autodidactic Generative Examination Networks

09/10/2019
by   Ruud van Deursen, et al.
5

Recurrent neural networks have been widely used to generate millions of de novo molecules in a known chemical space. These deep generative models are typically setup with LSTM or GRU units and trained with canonical SMILEs. In this study, we introduce a new robust architecture, Generative Examination Networks GEN, based on bidirectional RNNs with concatenated sub-models to learn and generate molecular SMILES with a trained target space. GENs autonomously learn the target space in a few epochs while being subjected to an independent online examination mechanism to measure the quality of the generated set. Here we have used online statistical quality control (SQC) on the percentage of valid molecules SMILES as an examination measure to select the earliest available stable model weights. Very high levels of valid SMILES (95-98 be generated using multiple parallel encoding layers in combination with SMILES augmentation using unrestricted SMILES randomization. Our architecture combines an excellent novelty rate (85-90 conservation of the property space (95-99 is open to other quality criteria.

READ FULL TEXT

page 4

page 6

research
12/06/2021

Keeping it Simple: Language Models can learn Complex Molecular Distributions

Deep generative models of molecules have grown immensely in popularity, ...
research
11/25/2021

Fragment-based molecular generative model with high generalization ability and synthetic accessibility

Deep generative models are attracting great attention for molecular desi...
research
06/12/2019

A Model to Search for Synthesizable Molecules

Deep generative models are able to suggest new organic molecules by gene...
research
09/24/2019

Deep Generative Model for Sparse Graphs using Text-Based Learning with Augmentation in Generative Examination Networks

Graphs and networks are a key research tool for a variety of science fie...
research
03/26/2018

Fréchet ChemblNet Distance: A metric for generative models for molecules

The new wave of successful generative models in machine learning has inc...
research
05/11/2023

MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation

Deep generative models have recently achieved superior performance in 3D...
research
10/05/2020

MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization

Molecule optimization is a fundamental task for accelerating drug discov...

Please sign up or login with your details

Forgot password? Click here to reset