DeepAI AI Chat
Log In Sign Up

End-to-end Learning for Short Text Expansion

by   Jian Tang, et al.

Effectively making sense of short texts is a critical task for many real world applications such as search engines, social media services, and recommender systems. The task is particularly challenging as a short text contains very sparse information, often too sparse for a machine learning algorithm to pick up useful signals. A common practice for analyzing short text is to first expand it with external information, which is usually harvested from a large collection of longer texts. In literature, short text expansion has been done with all kinds of heuristics. We propose an end-to-end solution that automatically learns how to expand short text to optimize a given learning task. A novel deep memory network is proposed to automatically find relevant information from a collection of longer documents and reformulate the short text through a gating mechanism. Using short text classification as a demonstrating task, we show that the deep memory network significantly outperforms classical text expansion methods with comprehensive experiments on real world data sets.


page 1

page 2

page 3

page 4


Short Text Topic Modeling Techniques, Applications, and Performance: A Survey

Analyzing short texts infers discriminative and coherent latent topics t...

Context Enhanced Short Text Matching using Clickthrough Data

The short text matching task employs a model to determine whether two sh...

Improving Medical Short Text Classification with Semantic Expansion Using Word-Cluster Embedding

Automatic text classification (TC) research can be used for real-world p...

Deep Short Text Classification with Knowledge Powered Attention

Short text classification is one of important tasks in Natural Language ...

Syntax-based Deep Matching of Short Texts

Many tasks in natural language processing, ranging from machine translat...

Language Detection For Short Text Messages In Social Media

With the constant growth of the World Wide Web and the number of documen...