Deep generative models of genetic variation capture mutation effects

12/18/2017
by   Adam J. Riesselman, et al.
0

The functions of proteins and RNAs are determined by a myriad of interactions between their constituent residues, but most quantitative models of how molecular phenotype depends on genotype must approximate this by simple additive effects. While recent models have relaxed this constraint to also account for pairwise interactions, these approaches do not provide a tractable path towards modeling higher-order dependencies. Here, we show how latent variable models with nonlinear dependencies can be applied to capture beyond-pairwise constraints in biomolecules. We present a new probabilistic model for sequence families, DeepSequence, that can predict the effects of mutations across a variety of deep mutational scanning experiments significantly better than site independent or pairwise models that are based on the same evolutionary data. The model, learned in an unsupervised manner solely from sequence information, is grounded with biologically motivated priors, reveals latent organization of sequence families, and can be used to extrapolate to new parts of sequence space

READ FULL TEXT

page 4

page 7

page 9

page 11

page 12

page 15

page 16

page 17

research
11/10/2014

Deep Exponential Families

We describe deep exponential families (DEFs), a class of latent variable...
research
10/10/2022

Scaling Up Probabilistic Circuits by Latent Variable Distillation

Probabilistic Circuits (PCs) are a unified framework for tractable proba...
research
12/03/2020

Generative Capacity of Probabilistic Protein Sequence Models

Variational autoencoders (VAEs) have recently gained popularity as gener...
research
03/07/2023

Speech Modeling with a Hierarchical Transformer Dynamical VAE

The dynamical variational autoencoders (DVAEs) are a family of latent-va...
research
02/16/2023

Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits

Probabilistic Circuits (PCs) are a general and unified computational fra...
research
11/12/2019

Purifying Interaction Effects with the Functional ANOVA: An Efficient Algorithm for Recovering Identifiable Additive Models

Recent methods for training generalized additive models (GAMs) with pair...

Please sign up or login with your details

Forgot password? Click here to reset