Dilated Convolutions for Modeling Long-Distance Genomic Dependencies

10/03/2017
by   Alexander M. Rush, et al.
0

We consider the task of detecting regulatory elements in the human genome directly from raw DNA. Past work has focused on small snippets of DNA, making it difficult to model long-distance dependencies that arise from DNA's 3-dimensional conformation. In order to study long-distance dependencies, we develop and release a novel dataset for a larger-context modeling task. Using this new data set we model long-distance interactions using dilated convolutional neural networks, and compare them to standard convolutions and recurrent neural networks. We show that dilated convolutions are effective at modeling the locations of regulatory markers in the human genome, such as transcription factor binding sites, histone modifications, and DNAse hypersensitivity sites.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2017

DNA Steganalysis Using Deep Recurrent Neural Networks

The technique of hiding messages in digital data is called a steganograp...
research
02/22/2017

Memory Matching Networks for Genomic Sequence Classification

When analyzing the genome, researchers have discovered that proteins bin...
research
07/13/2019

Multi-Element Long Distance Dependencies: Using SPk Languages to Explore the Characteristics of Long-Distance Dependencies

In order to successfully model Long Distance Dependencies (LDDs) it is n...
research
07/20/2020

i6mA-CNN: a convolution based computational approach towards identification of DNA N6-methyladenine sites in rice genome

Motivation: DNA N6-methylation (6mA) in Adenine nucleotide is a post rep...
research
08/03/2015

Unsupervised Learning in Genome Informatics

With different genomes available, unsupervised learning algorithms are e...
research
12/23/2016

Language Modeling with Gated Convolutional Networks

The pre-dominant approach to language modeling to date is based on recur...
research
10/06/2017

Comparing reverse complementary genomic words based on their distance distributions and frequencies

In this work we study reverse complementary genomic word pairs in the hu...

Please sign up or login with your details

Forgot password? Click here to reset