Recovery of Linear Components: Reduced Complexity Autoencoder Designs

12/14/2020
by   Federico Zocco, et al.
0

Reducing dimensionality is a key preprocessing step in many data analysis applications to address the negative effects of the curse of dimensionality and collinearity on model performance and computational complexity, to denoise the data or to reduce storage requirements. Moreover, in many applications it is desirable to reduce the input dimensions by choosing a subset of variables that best represents the entire set without any a priori information available. Unsupervised variable selection techniques provide a solution to this second problem. An autoencoder, if properly regularized, can solve both unsupervised dimensionality reduction and variable selection, but the training of large neural networks can be prohibitive in time sensitive applications. We present an approach called Recovery of Linear Components (RLC), which serves as a middle ground between linear and non-linear dimensionality reduction techniques, reducing autoencoder training times while enhancing performance over purely linear techniques. With the aid of synthetic and real world case studies, we show that the RLC, when compared with an autoencoder of similar complexity, shows higher accuracy, similar robustness to overfitting, and faster training times. Additionally, at the cost of a relatively small increase in computational complexity, RLC is shown to outperform the current state-of-the-art for a semiconductor manufacturing wafer measurement site optimization application.

READ FULL TEXT
research
03/03/2021

Greedy Search Algorithms for Unsupervised Variable Selection: A Comparative Study

Dimensionality reduction is a important step in the development of scala...
research
06/07/2018

Feature selection in functional data classification with recursive maxima hunting

Dimensionality reduction is one of the key issues in the design of effec...
research
04/19/2018

Randomized ICA and LDA Dimensionality Reduction Methods for Hyperspectral Image Classification

Dimensionality reduction is an important step in processing the hyperspe...
research
06/03/2021

A Subspace-based Approach for Dimensionality Reduction and Important Variable Selection

An analysis of high dimensional data can offer a detailed description of...
research
06/19/2023

Nonlinear Feature Aggregation: Two Algorithms driven by Theory

Many real-world machine learning applications are characterized by a hug...
research
11/23/2016

Adaptive Down-Sampling and Dimension Reduction in Time Elastic Kernel Machines for Efficient Recognition of Isolated Gestures

In the scope of gestural action recognition, the size of the feature vec...
research
10/07/2020

Less is more: Faster and better music version identification with embedding distillation

Version identification systems aim to detect different renditions of the...

Please sign up or login with your details

Forgot password? Click here to reset