Sequence Modeling with Multiresolution Convolutional Memory

05/02/2023
by   Jiaxin Shi, et al.
0

Efficiently capturing the long-range patterns in sequential data sources salient to a given task – such as classification and generative modeling – poses a fundamental challenge. Popular approaches in the space tradeoff between the memory burden of brute-force enumeration and comparison, as in transformers, the computational burden of complicated sequential dependencies, as in recurrent neural networks, or the parameter burden of convolutional networks with many or large filters. We instead take inspiration from wavelet-based multiresolution analysis to define a new building block for sequence modeling, which we call a MultiresLayer. The key component of our model is the multiresolution convolution, capturing multiscale trends in the input sequence. Our MultiresConv can be implemented with shared filters across a dilated causal convolution tree. Thus it garners the computational advantages of convolutional networks and the principled theoretical motivation of wavelet decompositions. Our MultiresLayer is straightforward to implement, requires significantly fewer parameters, and maintains at most a 𝒪(Nlog N) memory footprint for a length N sequence. Yet, by stacking such layers, our model yields state-of-the-art performance on a number of sequence classification and autoregressive density estimation tasks using CIFAR-10, ListOps, and PTB-XL datasets.

READ FULL TEXT

page 4

page 16

research
10/31/2021

Efficiently Modeling Long Sequences with Structured State Spaces

A central goal of sequence modeling is designing a single principled mod...
research
12/28/2017

PixelSNAIL: An Improved Autoregressive Generative Model

Autoregressive generative models consistently achieve the best results i...
research
02/18/2019

STCN: Stochastic Temporal Convolutional Networks

Convolutional architectures have recently been shown to be competitive o...
research
01/06/2022

Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks

Classification of long sequential data is an important Machine Learning ...
research
03/04/2018

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

For most deep learning practitioners, sequence modeling is synonymous wi...
research
01/19/2018

EffNet: An Efficient Structure for Convolutional Neural Networks

With the ever increasing application of Convolutional Neural Networks to...
research
02/04/2021

CKConv: Continuous Kernel Convolution For Sequential Data

Conventional neural architectures for sequential data present important ...

Please sign up or login with your details

Forgot password? Click here to reset