Effective Use of Word Order for Text Categorization with Convolutional Neural Networks

12/01/2014
by   Rie Johnson, et al.
0

Convolutional neural network (CNN) is a neural network that can make use of the internal structure of data such as the 2D structure of image data. This paper studies CNN on text categorization to exploit the 1D structure (namely, word order) of text data for accurate prediction. Instead of using low-dimensional word vectors as input as is often done, we directly apply CNN to high-dimensional text data, which leads to directly learning embedding of small text regions for use in classification. In addition to a straightforward adaptation of CNN from image to text, a simple but new variation which employs bag-of-word conversion in the convolution layer is proposed. An extension to combine multiple convolution layers is also explored for higher accuracy. The experiments demonstrate the effectiveness of our approach in comparison with state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2020

Combine Convolution with Recurrent Networks for Text Classification

Convolutional neural network (CNN) and recurrent neural network (RNN) ar...
research
01/07/2020

Multimodal Semantic Transfer from Text to Image. Fine-Grained Image Classification by Distributional Semantics

In the last years, image classification processes like neural networks i...
research
03/11/2019

An Innovative Word Encoding Method For Text Classification Using Convolutional Neural Network

Text classification plays a vital role today especially with the intensi...
research
11/06/2017

Convolutional Neural Network Steganalysis's Application to Steganography

This paper presents a novel approach to increase the performance bounds ...
research
07/20/2020

Cephalometric Landmark Regression with Convolutional Neural Networks on 3D Computed Tomography Data

In this paper, we address the problem of automatic three-dimensional cep...
research
11/25/2017

Towards Accurate Deceptive Opinion Spam Detection based on Word Order-preserving CNN

As a mainly network of Internet naval activities, the deceptive opinion ...
research
09/13/2021

Embedding Convolutions for Short Text Extreme Classification with Millions of Labels

Automatic annotation of short-text data to a large number of target labe...

Please sign up or login with your details

Forgot password? Click here to reset