A Local Similarity-Preserving Framework for Nonlinear Dimensionality Reduction with Neural Networks

03/10/2021
by   Xiang Wang, et al.
0

Real-world data usually have high dimensionality and it is important to mitigate the curse of dimensionality. High-dimensional data are usually in a coherent structure and make the data in relatively small true degrees of freedom. There are global and local dimensionality reduction methods to alleviate the problem. Most of existing methods for local dimensionality reduction obtain an embedding with the eigenvalue or singular value decomposition, where the computational complexities are very high for a large amount of data. Here we propose a novel local nonlinear approach named Vec2vec for general purpose dimensionality reduction, which generalizes recent advancements in embedding representation learning of words to dimensionality reduction of matrices. It obtains the nonlinear embedding using a neural network with only one hidden layer to reduce the computational complexity. To train the neural network, we build the neighborhood similarity graph of a matrix and define the context of data points by exploiting the random walk properties. Experiments demenstrate that Vec2vec is more efficient than several state-of-the-art local dimensionality reduction methods in a large number of high-dimensional data. Extensive experiments of data classification and clustering on eight real datasets show that Vec2vec is better than several classical dimensionality reduction methods in the statistical hypothesis test, and it is competitive with recently developed state-of-the-art UMAP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2020

Supervised Discriminative Sparse PCA with Adaptive Neighbors for Dimensionality Reduction

Dimensionality reduction is an important operation in information visual...
research
11/13/2012

Multi-Sensor Fusion via Reduction of Dimensionality

Large high-dimensional datasets are becoming more and more popular in an...
research
11/07/2018

SRP: Efficient class-aware embedding learning for large-scale data via supervised random projections

Supervised dimensionality reduction strategies have been of great intere...
research
01/30/2020

NCVis: Noise Contrastive Approach for Scalable Visualization

Modern methods for data visualization via dimensionality reduction, such...
research
06/10/2011

A Computational Framework for Nonlinear Dimensionality Reduction of Large Data Sets: The Exploratory Inspection Machine (XIM)

In this paper, we present a novel computational framework for nonlinear ...
research
06/27/2012

Regularizers versus Losses for Nonlinear Dimensionality Reduction: A Factored View with New Convex Relaxations

We demonstrate that almost all non-parametric dimensionality reduction m...

Please sign up or login with your details

Forgot password? Click here to reset