Factoring out prior knowledge from low-dimensional embeddings

03/02/2021
by   Edith Heiter, et al.
0

Low-dimensional embedding techniques such as tSNE and UMAP allow visualizing high-dimensional data and therewith facilitate the discovery of interesting structure. Although they are widely used, they visualize data as is, rather than in light of the background knowledge we have about the data. What we already know, however, strongly determines what is novel and hence interesting. In this paper we propose two methods for factoring out prior knowledge in the form of distance matrices from low-dimensional embeddings. To factor out prior knowledge from tSNE embeddings, we propose JEDI that adapts the tSNE objective in a principled way using Jensen-Shannon divergence. To factor out prior knowledge from any downstream embedding approach, we propose CONFETTI, in which we directly operate on the input distance matrices. Extensive experiments on both synthetic and real world data show that both methods work well, providing embeddings that exhibit meaningful structure that would otherwise remain hidden.

READ FULL TEXT

page 6

page 7

page 27

research
05/24/2019

Conditional t-SNE: Complementary t-SNE embeddings through factoring out prior information

Dimensionality reduction and manifold learning methods such as t-Distrib...
research
10/18/2021

Topologically Regularized Data Embeddings

Unsupervised feature learning often finds low-dimensional embeddings tha...
research
01/31/2023

Preserving local densities in low-dimensional embeddings

Low-dimensional embeddings and visualizations are an indispensable tool ...
research
09/03/2015

Encoding Prior Knowledge with Eigenword Embeddings

Canonical correlation analysis (CCA) is a method for reducing the dimens...
research
06/27/2020

Local Causal Structure Learning and its Discovery Between Type 2 Diabetes and Bone Mineral Density

Type 2 diabetes (T2DM), one of the most prevalent chronic diseases, affe...
research
09/27/2011

Generative Prior Knowledge for Discriminative Classification

We present a novel framework for integrating prior knowledge into discri...
research
05/21/2021

Elliptical Ordinal Embedding

Ordinal embedding aims at finding a low dimensional representation of ob...

Please sign up or login with your details

Forgot password? Click here to reset