A Probabilistic Graph Coupling View of Dimension Reduction

01/31/2022
βˆ™
by   Hugues van Assel, et al.
βˆ™
0
βˆ™

Most popular dimension reduction (DR) methods like t-SNE and UMAP are based on minimizing a cost between input and latent pairwise similarities. Though widely used, these approaches lack clear probabilistic foundations to enable a full understanding of their properties and limitations. To that extent, we introduce a unifying statistical framework based on the coupling of hidden graphs using cross entropy. These graphs induce a Markov random field dependency structure among the observations in both input and latent spaces. We show that existing pairwise similarity DR methods can be retrieved from our framework with particular choices of priors for the graphs. Moreover this reveals that these methods suffer from a statistical deficiency that explains poor performances in conserving coarse-grain dependencies. Our model is leveraged and extended to address this issue while new links are drawn with Laplacian eigenmaps and PCA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
βˆ™ 12/08/2020

Understanding How Dimension Reduction Tools Work: An Empirical Approach to Deciphering t-SNE, UMAP, TriMAP, and PaCMAP for Data Visualization

Dimension reduction (DR) techniques such as t-SNE, UMAP, and TriMAP have...
research
βˆ™ 05/22/2020

Rdimtools: An R package for Dimension Reduction and Intrinsic Dimension Estimation

Discovering patterns of the complex high-dimensional data is a long-stan...
research
βˆ™ 08/04/2017

A Latent Variable Model for Two-Dimensional Canonical Correlation Analysis and its Variational Inference

Describing the dimension reduction (DR) techniques by means of probabili...
research
βˆ™ 06/19/2019

Bayesian inverse regression for supervised dimension reduction with small datasets

We consider supervised dimension reduction problems, namely to identify ...
research
βˆ™ 05/23/2023

SNEkhorn: Dimension Reduction with Symmetric Entropic Affinities

Many approaches in machine learning rely on a weighted graph to encode t...
research
βˆ™ 09/24/2019

Dimension Estimation Using Autoencoders

Dimension Estimation (DE) and Dimension Reduction (DR) are two closely r...

Please sign up or login with your details

Forgot password? Click here to reset