Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

02/24/2015
by   Hamid Palangi, et al.
0

This paper develops a model that addresses sentence embedding, a hot topic in current natural language processing research, using recurrent neural networks with Long Short-Term Memory (LSTM) cells. Due to its ability to capture long term memory, the LSTM-RNN accumulates increasingly richer information as it goes through the sentence, and when it reaches the last word, the hidden layer of the network provides a semantic representation of the whole sentence. In this paper, the LSTM-RNN is trained in a weakly supervised manner on user click-through data logged by a commercial web search engine. Visualization and analysis are performed to understand how the embedding process works. The model is found to automatically attenuate the unimportant words and detects the salient keywords in the sentence. Furthermore, these detected keywords are found to automatically activate different cells of the LSTM-RNN, where words belonging to a similar topic activate the same cell. As a semantic representation of the sentence, the embedding vector can be used in many different applications. These automatic keyword detection and topic allocation abilities enabled by the LSTM-RNN allow the network to perform document retrieval, a difficult language processing task, where the similarity between the query and documents can be measured by the distance between their corresponding sentence embedding vectors computed by the LSTM-RNN. On a web search task, the LSTM-RNN embedding is shown to significantly outperform several existing state of the art methods. We emphasize that the proposed model generates sentence embedding vectors that are specially useful for web document retrieval tasks. A comparison with a well known general sentence embedding method, the Paragraph Vector, is performed. The results show that the proposed method in this paper significantly outperforms it for web document retrieval task.

READ FULL TEXT

page 1

page 7

page 16

page 22

page 23

page 24

research
11/01/2016

Recurrent Neural Network Language Model Adaptation Derived Document Vector

In many natural language processing (NLP) tasks, a document is commonly ...
research
06/20/2016

LSTM-Based Predictions for Proactive Information Retrieval

We describe a method for proactive information retrieval targeted at ret...
research
05/07/2015

Jointly Modeling Embedding and Translation to Bridge Video and Language

Automatically describing video content with natural language is a fundam...
research
12/11/2017

A Novel Way of Identifying Cyber Predators

Recurrent Neural Networks with Long Short-Term Memory cell (LSTM-RNN) ha...
research
11/15/2017

A Sequential Neural Encoder with Latent Structured Description for Modeling Sentences

In this paper, we propose a sequential neural encoder with latent struct...
research
07/22/2019

Towards an LSTM-based Predictive Framework for Literature-based Knowledge Discovery

Literature-based knowledge discovery process identifies the important bu...
research
12/03/2015

Effective LSTMs for Target-Dependent Sentiment Classification

Target-dependent sentiment classification remains a challenge: modeling ...

Please sign up or login with your details

Forgot password? Click here to reset