Supervising Embedding Algorithms Using the Stress

07/14/2022
by   Ery Arias-Castro, et al.
0

While classical scaling, just like principal component analysis, is parameter-free, most other methods for embedding multivariate data require the selection of one or several parameters. This tuning can be difficult due to the unsupervised nature of the situation. We propose a simple, almost obvious, approach to supervise the choice of tuning parameter(s): minimize a notion of stress. We substantiate this choice by reference to rigidity theory. We extend a result by Aspnes et al. (IEEE Mobile Computing, 2006), showing that general random geometric graphs are trilateration graphs with high probability. And we provide a stability result à la Anderson et al. (SIAM Discrete Mathematics, 2010). We illustrate this approach in the context of the MDS-MAP(P) algorithm of Shang and Ruml (IEEE INFOCOM, 2004). As a prototypical patch-stitching method, it requires the choice of patch size, and we use the stress to make that choice data-driven. In this context, we perform a number of experiments to illustrate the validity of using the stress as the basis for tuning parameter selection. In so doing, we uncover a bias-variance tradeoff, which is a phenomenon which may have been overlooked in the multidimensional scaling literature. By turning MDS-MAP(P) into a method for manifold learning, we obtain a local version of Isomap for which the minimization of the stress may also be used for parameter tuning.

READ FULL TEXT

page 14

page 15

page 22

research
09/14/2017

Generalized Biplots for Multidimensional Scaled Projections

Dimension reduction and visualization is a staple of data analytics. Met...
research
09/17/2020

Multidimensional Scaling, Sammon Mapping, and Isomap: Tutorial and Survey

Multidimensional Scaling (MDS) is one of the first fundamental manifold ...
research
12/21/2016

Stochastic Multidimensional Scaling

Multidimensional scaling (MDS) is a popular dimensionality reduction tec...
research
10/12/2017

Graph Drawing by Weighted Constraint Relaxation

A popular method of force-directed graph drawing is multidimensional sca...
research
08/31/2014

Persistent Homology in Sparse Regression and Its Application to Brain Morphometry

Sparse systems are usually parameterized by a tuning parameter that dete...
research
04/10/2018

Representation Tradeoffs for Hyperbolic Embeddings

Hyperbolic embeddings offer excellent quality with few dimensions when e...
research
10/07/2019

Role of local mode mixity and stress triaxiality in the fracture of niobium/alumina bi-crystal interfaces _ a CPFEM based study

Local mode mixity and stress triaxiality plays an important role during ...

Please sign up or login with your details

Forgot password? Click here to reset