Efficient Text Classification Using Tree-structured Multi-linear Principal Component Analysis

01/20/2018
by   Yuanhang Su, et al.
0

A novel text data dimension reduction technique, called the tree-structured multi-linear principal component anal- ysis (TMPCA), is proposed in this work. Being different from traditional text dimension reduction methods that deal with the word-level representation, the TMPCA technique reduces the dimension of input sequences and sentences to simplify the following text classification tasks. It is shown mathematically and experimentally that the TMPCA tool demands much lower complexity (and, hence, less computing power) than the ordinary principal component analysis (PCA). Furthermore, it is demon- strated by experimental results that the support vector machine (SVM) method applied to the TMPCA-processed data achieves commensurable or better performance than the state-of-the-art recurrent neural network (RNN) approach.

READ FULL TEXT
research
01/20/2018

Efficient Text Classification Using Tree-structured Multi-linear Principle Component Analysis

A novel text data dimension reduction technique, called the tree-structu...
research
07/22/2018

On Tree-structured Multi-stage Principal Component Analysis (TMPCA) for Text Classification

A novel sequence-to-vector (seq2vec) embedding method, called the tree-s...
research
11/09/2017

Dimension Reduction of High-Dimensional Datasets Based on Stepwise SVM

The current study proposes a dimension reduction method, stepwise suppor...
research
10/04/2019

A Comparison Study on Nonlinear Dimension Reduction Methods with Kernel Variations: Visualization, Optimization and Classification

Because of high dimensionality, correlation among covariates, and noise ...
research
01/13/2019

Image retrieval method based on CNN and dimension reduction

An image retrieval method based on convolution neural network and dimens...
research
06/11/2020

A multi-objective-based approach for Fair Principal Component Analysis

In dimension reduction problems, the adopted technique may produce dispa...
research
12/04/2017

A text-independent speaker verification model: A comparative analysis

The most pressing challenge in the field of voice biometrics is selectin...

Please sign up or login with your details

Forgot password? Click here to reset