A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines

01/04/2018
by   David Charte, et al.
0

Many of the existing machine learning algorithms, both supervised and unsupervised, depend on the quality of the input characteristics to generate a good model. The amount of these variables is also important, since performance tends to decline as the input dimensionality increases, hence the interest in using feature fusion techniques, able to produce feature sets that are more compact and higher level. A plethora of procedures to fuse original variables for producing new ones has been developed in the past decades. The most basic ones use linear combinations of the original variables, such as PCA (Principal Component Analysis) and LDA (Linear Discriminant Analysis), while others find manifold embeddings of lower dimensionality based on non-linear combinations, such as Isomap or LLE (Linear Locally Embedding) techniques. More recently, autoencoders (AEs) have emerged as an alternative to manifold learning for conducting nonlinear feature fusion. Dozens of AE models have been proposed lately, each with its own specific traits. Although many of them can be used to generate reduced feature sets through the fusion of the original ones, there also AEs designed with other applications in mind. The goal of this paper is to provide the reader with a broad view of what an AE is, how they are used for feature fusion, a taxonomy gathering a broad range of models, and how they relate to other classical techniques. In addition, a set of didactic guidelines on how to choose the proper AE for a given task is supplied, together with a discussion of the software tools available. Finally, two case studies illustrate the usage of AEs with datasets of handwritten digits and breast cancer.

READ FULL TEXT

page 5

page 17

page 19

research
08/20/2011

Multisensor Images Fusion Based on Feature-Level

Until now, of highest relevance for remote sensing data processing and a...
research
05/21/2020

An analysis on the use of autoencoders for representation learning: fundamentals, learning task case studies, explainability and challenges

In many machine learning tasks, learning a good representation of the da...
research
01/31/2017

Computational Techniques in Multispectral Image Processing: Application to the Syriac Galen Palimpsest

Multispectral and hyperspectral image analysis has experienced much deve...
research
05/04/2021

Ovarian Cancer Detection based on Dimensionality Reduction Techniques and Genetic Algorithm

In this research, we have two serum SELDI (surface-enhanced laser desorp...
research
12/02/2015

Optimal whitening and decorrelation

Whitening, or sphering, is a common preprocessing step in statistical an...
research
05/30/2015

A Review of Feature and Data Fusion with Medical Images

The fusion techniques that utilize multiple feature sets to form new fea...
research
11/01/2019

High-dimensional Nonlinear Profile Monitoring based on Deep Probabilistic Autoencoders

Wide accessibility of imaging and profile sensors in modern industrial s...

Please sign up or login with your details

Forgot password? Click here to reset