Nonlinear Dimensionality Reduction for Discriminative Analytics of Multiple Datasets

05/15/2018
by   Jia Chen, et al.
0

Principal component analysis (PCA) is widely used for feature extraction and dimensionality reduction, with documented merits in diverse tasks involving high-dimensional data. Standard PCA copes with one dataset at a time, but it is challenged when it comes to analyzing multiple datasets jointly. In certain data science settings however, one is often interested in extracting the most discriminative information from one dataset of particular interest (a.k.a. target data) relative to the other(s) (a.k.a. background data). To this end, this paper puts forth a novel approach, termed discriminative (d) PCA, for such discriminative analytics of multiple datasets. Under certain conditions, dPCA is proved to be least-squares optimal in recovering the component vector unique to the target data relative to background data. To account for nonlinear data correlations, (linear) dPCA models for one or multiple background datasets are generalized through kernel-based learning. Interestingly, all dPCA variants admit an analytical solution obtainable with a single (generalized) eigenvalue decomposition. Finally, corroborating dimensionality reduction tests using both synthetic and real datasets are provided to validate the effectiveness of the proposed methods.

READ FULL TEXT

page 7

page 9

research
10/25/2017

DPCA: Dimensionality Reduction for Discriminative Analytics of Multiple Large-Scale Datasets

Principal component analysis (PCA) has well-documented merits for data e...
research
01/09/2020

Supervised Discriminative Sparse PCA with Adaptive Neighbors for Dimensionality Reduction

Dimensionality reduction is an important operation in information visual...
research
01/29/2018

Nonlinear Dimensionality Reduction on Graphs

In this era of data deluge, many signal processing and machine learning ...
research
11/14/2018

Unsupervised learning with contrastive latent variable models

In unsupervised learning, dimensionality reduction is an important tool ...
research
08/01/2017

DROP: Dimensionality Reduction Optimization for Time Series

Dimensionality reduction is critical in analyzing increasingly high-volu...
research
01/31/2016

Principal Polynomial Analysis

This paper presents a new framework for manifold learning based on a seq...
research
01/22/2019

Efficient Image Splicing Localization via Contrastive Feature Extraction

In this work, we propose a new data visualization and clustering techniq...

Please sign up or login with your details

Forgot password? Click here to reset