Multidimensional Scaling of Noisy High Dimensional Data

01/30/2018
by   Erez Peterfreund, et al.
0

Multidimensional Scaling (MDS) is a classical technique for embedding data in low dimensions, still in widespread use today. Originally introduced in the 1950's, MDS was not designed with high-dimensional data in mind; while it remains popular with data analysis practitioners, no doubt it should be adapted to the high-dimensional data regime. In this paper we study MDS under modern setting, and specifically, high dimensions and ambient measurement noise. We show that, as the ambient noise level increase, MDS suffers a sharp breakdown that depends on the data dimension and noise level, and derive an explicit formula for this breakdown point in the case of white noise. We then introduce MDS+, an extremely simple variant of MDS, which applies a carefully derived shrinkage nonlinearity to the eigenvalues of the MDS similarity matrix. Under a loss function measuring the embedding quality, MDS+ is the unique asymptotically optimal shrinkage function. We prove that MDS+ offers improved embedding, sometimes significantly so, compared with classical MDS. Furthermore, MDS+ does not require external estimates of the embedding dimension (a famous difficulty in classical MDS), as it calculates the optimal dimension into which the data should be embedded.

READ FULL TEXT
research
09/17/2020

Multidimensional Scaling, Sammon Mapping, and Isomap: Tutorial and Survey

Multidimensional Scaling (MDS) is one of the first fundamental manifold ...
research
09/23/2021

Multidimensional Scaling: Approximation and Complexity

Metric Multidimensional scaling (MDS) is a classical method for generati...
research
09/14/2022

Cluster-based multidimensional scaling embedding tool for data visualization

We present a new technique for visualizing high-dimensional data called ...
research
12/29/2021

The Classical Multidimensional Scaling Revisited

We reexamine the the classical multidimensional scaling (MDS). We study ...
research
06/01/2018

Pattern Search Multidimensional Scaling

We present a novel view of nonlinear manifold learning using derivative-...
research
10/24/2018

Modified Multidimensional Scaling and High Dimensional Clustering

Multidimensional scaling is an important dimension reduction tool in sta...
research
06/01/2018

Pattern Search MDS

We present a novel view of nonlinear manifold learning using derivative-...

Please sign up or login with your details

Forgot password? Click here to reset