Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization

05/28/2019
by   Ting Huang, et al.
0

Recurrent Neural Networks (RNNs) are widely used in the field of natural language processing (NLP), ranging from text categorization to question answering and machine translation. However, RNNs generally read the whole text from beginning to end or vice versa sometimes, which makes it inefficient to process long texts. When reading a long document for a categorization task, such as topic categorization, large quantities of words are irrelevant and can be skipped. To this end, we propose Leap-LSTM, an LSTM-enhanced model which dynamically leaps between words while reading texts. At each step, we utilize several feature encoders to extract messages from preceding texts, following texts and the current word, and then determine whether to skip the current word. We evaluate Leap-LSTM on several text categorization tasks: sentiment analysis, news categorization, ontology classification and topic classification, with five benchmark data sets. The experimental results show that our model reads faster and predicts better than standard LSTM. Compared to previous models which can also skip words, our model achieves better trade-offs between performance and efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2017

Learning to Skim Text

Recurrent Neural Networks are showing much promise in many sub-areas of ...
research
03/20/2019

Neural Speed Reading with Structural-Jump-LSTM

Recurrent neural networks (RNNs) can model natural language by sequentia...
research
11/09/2018

Long Short-Term Memory with Dynamic Skip Connections

In recent years, long short-term memory (LSTM) has been successfully use...
research
02/07/2016

Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings

One-hot CNN (convolutional neural network) has been shown to be effectiv...
research
02/19/2016

Contextual LSTM (CLSTM) models for Large scale NLP tasks

Documents exhibit sequential structure at multiple levels of abstraction...
research
04/26/2020

Detect Language of Transliterated Texts

Informal transliteration from other languages to English is prevalent in...
research
02/25/2019

Attentional Encoder Network for Targeted Sentiment Classification

Targeted sentiment classification aims at determining the sentimental te...

Please sign up or login with your details

Forgot password? Click here to reset