Empirical comparison between autoencoders and traditional dimensionality reduction methods

03/08/2021
by   Quentin Fournier, et al.
0

In order to process efficiently ever-higher dimensional data such as images, sentences, or audio recordings, one needs to find a proper way to reduce the dimensionality of such data. In this regard, SVD-based methods including PCA and Isomap have been extensively used. Recently, a neural network alternative called autoencoder has been proposed and is often preferred for its higher flexibility. This work aims to show that PCA is still a relevant technique for dimensionality reduction in the context of classification. To this purpose, we evaluated the performance of PCA compared to Isomap, a deep autoencoder, and a variational autoencoder. Experiments were conducted on three commonly used image datasets: MNIST, Fashion-MNIST, and CIFAR-10. The four different dimensionality reduction techniques were separately employed on each dataset to project data into a low-dimensional space. Then a k-NN classifier was trained on each projection with a cross-validated random search over the number of neighbours. Interestingly, our experiments revealed that k-NN achieved comparable accuracy on PCA and both autoencoders' projections provided a big enough dimension. However, PCA computation time was two orders of magnitude faster than its neural network counterparts.

READ FULL TEXT
research
08/25/2016

Comparison among dimensionality reduction techniques based on Random Projection for cancer classification

Random Projection (RP) technique has been widely applied in many scenari...
research
11/28/2018

A Visual Interaction Framework for Dimensionality Reduction Based Data Exploration

Dimensionality reduction is a common method for analyzing and visualizin...
research
08/15/2022

On a Mechanism Framework of Autoencoders

This paper proposes a theoretical framework on the mechanism of autoenco...
research
10/12/2021

Label scarcity in biomedicine: Data-rich latent factor discovery enhances phenotype prediction

High-quality data accumulation is now becoming ubiquitous in the health ...
research
10/31/2018

The Price of Fair PCA: One Extra Dimension

We investigate whether the standard dimensionality reduction technique o...
research
06/11/2018

Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models

This study investigates the use of non-linear unsupervised dimensionalit...
research
08/02/2021

Bucketed PCA Neural Networks with Neurons Mirroring Signals

The bucketed PCA neural network (PCA-NN) with transforms is developed he...

Please sign up or login with your details

Forgot password? Click here to reset