Target specific peptide design using latent space approximate trajectory collector

02/02/2023
by   Tong Lin, et al.
0

Despite the prevalence and many successes of deep learning applications in de novo molecular design, the problem of peptide generation targeting specific proteins remains unsolved. A main barrier for this is the scarcity of the high-quality training data. To tackle the issue, we propose a novel machine learning based peptide design architecture, called Latent Space Approximate Trajectory Collector (LSATC). It consists of a series of samplers on an optimization trajectory on a highly non-convex energy landscape that approximates the distributions of peptides with desired properties in a latent space. The process involves little human intervention and can be implemented in an end-to-end manner. We demonstrate the model by the design of peptide extensions targeting Beta-catenin, a key nuclear effector protein involved in canonical Wnt signalling. When compared with a random sampler, LSATC can sample peptides with 36% lower binding scores in a 16 times smaller interquartile range (IQR) and 284% less hydrophobicity with a 1.4 times smaller IQR. LSATC also largely outperforms other common generative models. Finally, we utilized a clustering algorithm to select 4 peptides from the 100 LSATC designed peptides for experimental validation. The result confirms that all the four peptides extended by LSATC show improved Beta-catenin binding by at least 20.0%, and two of the peptides show a 3 fold increase in binding affinity as compared to the base peptide.

READ FULL TEXT
research
03/01/2022

Multi-Objective Latent Space Optimization of Generative Molecular Design Models

Molecular design based on generative models, such as variational autoenc...
research
07/14/2022

Comparing the latent space of generative models

Different encodings of datapoints in the latent space of latent-vector g...
research
09/29/2021

Designing Complex Experiments by Applying Unsupervised Machine Learning

Design of experiments (DOE) is playing an essential role in learning and...
research
06/10/2021

Score-based Generative Modeling in Latent Space

Score-based generative models (SGMs) have recently demonstrated impressi...
research
04/30/2020

Robustness Certification of Generative Models

Generative neural networks can be used to specify continuous transformat...
research
05/23/2019

A COLD Approach to Generating Optimal Samples

Optimising discrete data for a desired characteristic using gradient-bas...
research
06/17/2008

Decoding Beta-Decay Systematics: A Global Statistical Model for Beta^- Halflives

Statistical modeling of nuclear data provides a novel approach to nuclea...

Please sign up or login with your details

Forgot password? Click here to reset