DeepDiff: Deep-learning for predicting Differential gene expression from histone modifications

07/10/2018
by   Arshdeep Sekhon, et al.
4

Computational methods that predict differential gene expression from histone modification signals are highly desirable for understanding how histone modifications control the functional heterogeneity of cells through influencing differential gene regulation. Recent studies either failed to capture combinatorial effects on differential prediction or primarily only focused on cell type-specific analysis. In this paper, we develop a novel attention-based deep learning architecture, DeepDiff, that provides a unified and end-to-end solution to model and to interpret how dependencies among histone modifications control the differential patterns of gene regulation. DeepDiff uses a hierarchy of multiple Long short-term memory (LSTM) modules to encode the spatial structure of input signals and to model how various histone modifications cooperate automatically. We introduce and train two levels of attention jointly with the target prediction, enabling DeepDiff to attend differentially to relevant modifications and to locate important genome positions for each modification. Additionally, DeepDiff introduces a novel deep-learning based multi-task formulation to use the cell-type-specific gene expression predictions as auxiliary tasks, encouraging richer feature embeddings in our primary task of differential expression prediction. Using data from Roadmap Epigenomics Project (REMC) for ten different pairs of cell types, we show that DeepDiff significantly outperforms the state-of-the-art baselines for differential gene expression prediction. The learned attention weights are validated by observations from previous studies about how epigenetic mechanisms connect to differential gene expression. Codes and results are available at <deepchrome.org>

READ FULL TEXT

page 1

page 3

page 6

research
08/01/2017

Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin

The past decade has seen a revolution in genomic technologies that enabl...
research
12/15/2020

SimpleChrome: Encoding of Combinatorial Effects for Predicting Gene Expression

Due to recent breakthroughs in state-of-the-art DNA sequencing technolog...
research
09/24/2022

DeepChrome 2.0: Investigating and Improving Architectures, Visualizations, Experiments

Histone modifications play a critical role in gene regulation. Consequen...
research
05/23/2016

Genetic Architect: Discovering Genomic Structure with Learned Neural Architectures

Each human genome is a 3 billion base pair set of encoding instructions....
research
01/29/2021

A principle feature analysis

A key task of data science is to identify relevant features linked to ce...
research
03/17/2023

Breast Cancer Histopathology Image based Gene Expression Prediction using Spatial Transcriptomics data and Deep Learning

Tumour heterogeneity in breast cancer poses challenges in predicting out...
research
10/05/2020

Factorized linear discriminant analysis for phenotype-guided representation learning of neuronal gene expression data

A central goal in neurobiology is to relate the expression of genes to t...

Please sign up or login with your details

Forgot password? Click here to reset