Topological Stability: Guided Determination of the Nearest Neighbors in Non-Linear Dimensionality Reduction Techniques

11/13/2019
by   Mohammed Elhenawy, et al.
39

In machine learning field, dimensionality reduction is one of the important tasks. It mitigates the undesired properties of high-dimensional spaces to facilitate classification, compression, and visualization of high-dimensional data. During the last decade, researchers proposed a large number of new (nonlinear) techniques for dimensionality reduction. Most of these techniques are based on the intuition that data lies on or near a complex low-dimensional manifold that is embedded in the high-dimensional space. New techniques for dimensionality reduction aim at identifying and extracting the manifold from the high-dimensional space. Isomap is one of widely-used low-dimensional embedding methods, where geodesic distances on a weighted graph are incorporated with the classical scaling (metric multidimensional scaling). Isomap chooses the nearest neighbors based on the distance only which causes bridges and topological instability. In this paper we pay our attention to topological stability that was not considered in Isomap.because at any point on the manifold , that point and its nearest neighbors forms a vector subspace and the orthogonal to that subspace is orthogonal to all vectors spans the vector subspace. Our approach uses the point itself and its two nearest neighbors to find the bases of the subspace and the orthogonal to that subspace which belongs to the orthogonal complementary subspace. Our approach then adds new points to the two nearest neighbors based on the distance and the angle between each new point and the orthogonal to the subspace. The superior performance of the new approach in choosing the nearest neighbors is confirmed through experimental work with several datasets.

READ FULL TEXT

page 6

page 7

page 8

page 9

page 10

page 11

page 12

page 14

research
11/13/2019

Topological Stability: a New Algorithm for Selecting The Nearest Neighbors in Non-Linear Dimensionality Reduction Techniques

In the machine learning field, dimensionality reduction is an important ...
research
08/12/2021

Clustering with UMAP: Why and How Connectivity Matters

Topology based dimensionality reduction methods such as t-SNE and UMAP h...
research
09/09/2017

Simultaneously Learning Neighborship and Projection Matrix for Supervised Dimensionality Reduction

Explicitly or implicitly, most of dimensionality reduction methods need ...
research
12/06/2018

SqueezeFit: Label-aware dimensionality reduction by semidefinite programming

Given labeled points in a high-dimensional vector space, we seek a low-d...
research
02/23/2018

AEkNN: An AutoEncoder kNN-based classifier with built-in dimensionality reduction

High dimensionality, i.e. data having a large number of variables, tends...
research
09/22/2021

The Curse Revisited: a Newly Quantified Concept of Meaningful Distances for Learning from High-Dimensional Noisy Data

Distances between data points are widely used in point cloud representat...
research
01/11/2023

Towards Microstructural State Variables in Materials Systems

The vast combination of material properties seen in nature are achieved ...

Please sign up or login with your details

Forgot password? Click here to reset