On minimizers and convolutional filters: a partial justification for the unreasonable effectiveness of CNNs in categorical sequence analysis

11/09/2021
by   Yun William Yu, et al.
0

Minimizers and convolutional neural networks (CNNs) are two quite distinct popular techniques that have both been employed to analyze biological sequences. At face value, the methods seem entirely dissimilar. Minimizers use min-wise hashing on a rolling window to extract a single important k-mer feature per window. CNNs start with a wide array of randomly initialized convolutional filters, paired with a pooling operation, and then multiple additional neural layers to learn both the filters themselves and how those filters can be used to classify the sequence. In this manuscript, I demonstrate through a careful mathematical analysis of hash function properties that there are deep theoretical connections between minimizers and convolutional filters – in short, for sequences over a categorical alphabet, random Gaussian initialization of convolutional filters with max-pooling is equivalent to choosing minimizers from a random hash function biased towards more distinct k-mers. This provides a partial explanation for the unreasonable effectiveness of CNNs in categorical sequence analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2018

Understanding Convolutional Neural Networks for Text Classification

We present an analysis into the inner workings of Convolutional Neural N...
research
11/20/2015

Training CNNs with Low-Rank Filters for Efficient Image Classification

We propose a new method for creating computationally efficient convoluti...
research
01/23/2018

Learning to Prune Filters in Convolutional Neural Networks

Many state-of-the-art computer vision algorithms use large scale convolu...
research
02/27/2019

Modulated binary cliquenet

Although Convolutional Neural Networks (CNNs) achieve effectiveness in v...
research
06/10/2016

Deep CNNs along the Time Axis with Intermap Pooling for Robustness to Spectral Variations

Convolutional neural networks (CNNs) with convolutional and pooling oper...
research
06/11/2015

Spectral Representations for Convolutional Neural Networks

Discrete Fourier transforms provide a significant speedup in the computa...

Please sign up or login with your details

Forgot password? Click here to reset