A Computational Framework for Nonlinear Dimensionality Reduction of Large Data Sets: The Exploratory Inspection Machine (XIM)

06/10/2011
by   Axel Wismüller, et al.
0

In this paper, we present a novel computational framework for nonlinear dimensionality reduction which is specifically suited to process large data sets: the Exploratory Inspection Machine (XIM). XIM introduces a conceptual cross-link between hitherto separate domains of machine learning, namely topographic vector quantization and divergence-based neighbor embedding approaches. There are three ways to conceptualize XIM, namely (i) as the inversion of the Exploratory Observation Machine (XOM) and its variants, such as Neighbor Embedding XOM (NE-XOM), (ii) as a powerful optimization scheme for divergence-based neighbor embedding cost functions inspired by Stochastic Neighbor Embedding (SNE) and its variants, such as t-distributed SNE (t-SNE), and (iii) as an extension of topographic vector quantization methods, such as the Self-Organizing Map (SOM). By preserving both global and local data structure, XIM combines the virtues of classical and advanced recent embedding methods. It permits direct visualization of large data collections without the need for prior data reduction. Finally, XIM can contribute to many application domains of data analysis and visualization important throughout the sciences and engineering, such as pattern matching, constrained incremental learning, data clustering, and the analysis of non-metric dissimilarity data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2021

A Local Similarity-Preserving Framework for Nonlinear Dimensionality Reduction with Neural Networks

Real-world data usually have high dimensionality and it is important to ...
research
10/06/2021

T-SNE Is Not Optimized to Reveal Clusters in Data

Cluster visualization is an essential task for nonlinear dimensionality ...
research
08/29/2023

Tuning the perplexity for and computing sampling-based t-SNE embeddings

Widely used pipelines for the analysis of high-dimensional data utilize ...
research
11/30/2021

Towards a comprehensive visualization of structure in data

Dimensional data reduction methods are fundamental to explore and visual...
research
02/09/2017

Stochastic Neighbor Embedding separates well-separated clusters

Stochastic Neighbor Embedding and its variants are widely used dimension...
research
10/27/2018

Monitoring the shape of weather, soundscapes, and dynamical systems: a new statistic for dimension-driven data analysis on large data sets

Dimensionality-reduction methods are a fundamental tool in the analysis ...
research
09/07/2022

Dimensionality Reduction using Elastic Measures

With the recent surge in big data analytics for hyper-dimensional data t...

Please sign up or login with your details

Forgot password? Click here to reset