A COLD Approach to Generating Optimal Samples

05/23/2019
by   Omar Mahmood, et al.
0

Optimising discrete data for a desired characteristic using gradient-based methods involves projecting the data into a continuous latent space and carrying out optimisation in this space. Carrying out global optimisation is difficult as optimisers are likely to follow gradients into regions of the latent space that the model has not been exposed to during training; samples generated from these regions are likely to be too dissimilar to the training data to be useful. We propose Constrained Optimisation with Latent Distributions (COLD), a constrained global optimisation procedure to find samples with high values of a desired property that are similar to yet distinct from the training data. We find that on MNIST, our procedure yields optima for each of three different objectives, and that enforcing tighter constraints improves the quality and increases the diversity of the generated images. On the ChEMBL molecular dataset, our method generates a diverse set of new molecules with drug-likeness scores similar to those of the highest-scoring molecules in the training data. We also demonstrate a computationally efficient way to approximate the constraint when evaluating it exactly is computationally expensive.

READ FULL TEXT
research
03/01/2022

Multi-Objective Latent Space Optimization of Generative Molecular Design Models

Molecular design based on generative models, such as variational autoenc...
research
06/17/2022

LIMO: Latent Inceptionism for Targeted Molecule Generation

Generation of drug-like molecules with high binding affinity to target p...
research
08/18/2022

Improving Small Molecule Generation using Mutual Information Machine

We address the task of controlled generation of small molecules, which e...
research
03/31/2020

Distance in Latent Space as Novelty Measure

Deep Learning performs well when training data densely covers the experi...
research
11/07/2020

Identifying Mislabeled Images in Supervised Learning Utilizing Autoencoder

Supervised learning is based on the assumption that the ground truth in ...
research
02/02/2023

Target specific peptide design using latent space approximate trajectory collector

Despite the prevalence and many successes of deep learning applications ...

Please sign up or login with your details

Forgot password? Click here to reset