SQuadMDS: a lean Stochastic Quartet MDS improving global structure preservation in neighbor embedding like t-SNE and UMAP

02/24/2022
by   Pierre Lambert, et al.
0

Multidimensional scaling is a statistical process that aims to embed high dimensional data into a lower-dimensional space; this process is often used for the purpose of data visualisation. Common multidimensional scaling algorithms tend to have high computational complexities, making them inapplicable on large data sets. This work introduces a stochastic, force directed approach to multidimensional scaling with a time and space complexity of O(N), with N data points. The method can be combined with force directed layouts of the family of neighbour embedding such as t-SNE, to produce embeddings that preserve both the global and the local structures of the data. Experiments assess the quality of the embeddings produced by the standalone version and its hybrid extension both quantitatively and qualitatively, showing competitive results outperforming state-of-the-art approaches. Codes are available at https://github.com/PierreLambert3/SQuaD-MDS-and-FItSNE-hybrid.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2022

Cluster-based multidimensional scaling embedding tool for data visualization

We present a new technique for visualizing high-dimensional data called ...
research
09/23/2021

Multidimensional Scaling: Approximation and Complexity

Metric Multidimensional scaling (MDS) is a classical method for generati...
research
07/17/2020

A Unifying Perspective on Neighbor Embeddings along the Attraction-Repulsion Spectrum

Neighbor embeddings are a family of methods for visualizing complex high...
research
04/19/2021

Multidimensional Scaling for Gene Sequence Data with Autoencoders

Multidimensional scaling of gene sequence data has long played a vital r...
research
08/29/2023

Tuning the perplexity for and computing sampling-based t-SNE embeddings

Widely used pipelines for the analysis of high-dimensional data utilize ...
research
11/07/2021

High Performance Out-of-sample Embedding Techniques for Multidimensional Scaling

The recent rapid growth of the dimension of many datasets means that man...
research
04/03/2022

Force-directed algorithms for schematic drawings and placement: A survey

Force-directed algorithms have been developed over the last 50 years and...

Please sign up or login with your details

Forgot password? Click here to reset