Effects of padding on LSTMs and CNNs

03/18/2019
by   Mahidhar Dwarampudi, et al.
0

Long Short-Term Memory (LSTM) Networks and Convolutional Neural Networks (CNN) have become very common and are used in many fields as they were effective in solving many problems where the general neural networks were inefficient. They were applied to various problems mostly related to images and sequences. Since LSTMs and CNNs take inputs of the same length and dimension, input images and sequences are padded to maximum length while testing and training. This padding can affect the way the networks function and can make a great deal when it comes to performance and accuracies. This paper studies this and suggests the best way to pad an input sequence. This paper uses a simple sentiment analysis task for this purpose. We use the same dataset on both the networks with various padding to show the difference. This paper also discusses some preprocessing techniques done on the data to ensure effective analysis of the data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2017

BB_twtr at SemEval-2017 Task 4: Twitter Sentiment Analysis with CNNs and LSTMs

In this paper we describe our attempt at producing a state-of-the-art Tw...
research
04/14/2021

Process Outcome Prediction: CNN vs. LSTM (with Attention)

The early outcome prediction of ongoing or completed processes confers c...
research
12/06/2019

A limited-size ensemble of homogeneous CNN/LSTMs for high-performance word classification

In recent years, long short-term memory neural networks (LSTMs) have bee...
research
07/16/2018

Longitudinal detection of radiological abnormalities with time-modulated LSTM

Convolutional neural networks (CNNs) have been successfully employed in ...
research
02/08/2017

Automatic Rule Extraction from Long Short Term Memory Networks

Although deep learning models have proven effective at solving problems ...
research
12/02/2019

CNNs, LSTMs, and Attention Networks for Pathology Detection in Medical Data

For the weakly supervised task of electrocardiogram (ECG) rhythm classif...
research
03/01/2016

Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs

Recursive neural networks (RNN) and their recently proposed extension re...

Please sign up or login with your details

Forgot password? Click here to reset