Short Text Classification Improved by Feature Space Extension

04/02/2019
by   Yanxuan Li, et al.
0

With the explosive development of mobile Internet, short text has been applied extensively. The difference between classifying short text and long documents is that short text is of shortness and sparsity. Thus, it is challenging to deal with short text classification owing to its less semantic information. In this paper, we propose a novel topic-based convolutional neural network (TB-CNN) based on Latent Dirichlet Allocation (LDA) model and convolutional neural network. Comparing to traditional CNN methods, TB-CNN generates topic words with LDA model to reduce the sparseness and combines the embedding vectors of topic words and input words to extend feature space of short text. The validation results on IMDB movie review dataset show the improvement and effectiveness of TB-CNN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2016

On a Topic Model for Sentences

Probabilistic topic models are generative models that describe the conte...
research
08/28/2020

An Intelligent CNN-VAE Text Representation Technology Based on Text Semantics for Comprehensive Big Data

In the era of big data, a large number of text data generated by the Int...
research
02/09/2020

Short Text Classification via Knowledge powered Attention with Similarity Matrix based CNN

Short text is becoming more and more popular on the web, such as Chat Me...
research
03/15/2017

A Hybrid Supervised-unsupervised Method on Image Topic Visualization with Convolutional Neural Network and LDA

Given the progress in image recognition with recent data driven paradigm...
research
10/18/2018

TS-CNN: Text Steganalysis from Semantic Space Based on Convolutional Neural Network

Steganalysis has been an important research topic in cybersecurity that ...
research
02/27/2018

Convolutional Neural Networks for Toxic Comment Classification

Flood of information is produced in a daily basis through the global Int...
research
08/27/2018

Review Helpfulness Assessment based on Convolutional Neural Network

In this paper we describe the implementation of a convolutional neural n...

Please sign up or login with your details

Forgot password? Click here to reset