Function Preserving Projection for Scalable Exploration of High-Dimensional Data

09/25/2019
by   Shusen Liu, et al.
3

We present function preserving projections (FPP), a scalable linear projection technique for discovering interpretable relationships in high-dimensional data. Conventional dimension reduction methods aim to maximally preserve the global and/or local geometric structure of a dataset. However, in practice one is often more interested in determining how one or multiple user-selected response function(s) can be explained by the data. To intuitively connect the responses to the data, FPP constructs 2D linear embeddings optimized to reveal interpretable yet potentially non-linear patterns of the response functions. More specifically, FPP is designed to (i) produce human-interpretable embeddings; (ii) capture non-linear relationships; (iii) allow the simultaneous use of multiple response functions; and (iv) scale to millions of samples. Using FPP on real-world datasets, one can obtain fundamentally new insights about high-dimensional relationships in large-scale data that could not be achieved using existing dimension reduction methods.

READ FULL TEXT

page 5

page 7

page 8

research
05/20/2023

Contrastive inverse regression for dimension reduction

Supervised dimension reduction (SDR) has been a topic of growing interes...
research
03/11/2021

Modern Dimension Reduction

Data are not only ubiquitous in society, but are increasingly complex bo...
research
12/11/2020

Casting Multiple Shadows: High-Dimensional Interactive Data Visualisation with Tours and Embeddings

Non-linear dimensionality reduction (NLDR) methods such as t-distributed...
research
12/19/2017

Exploring High-Dimensional Structure via Axis-Aligned Decomposition of Linear Projections

Two-dimensional embeddings remain the dominant approach to visualize hig...
research
05/25/2018

On the Estimation of Entropy in the FastICA Algorithm

The fastICA algorithm is a popular dimension reduction technique used to...
research
08/30/2019

Fast and Accurate Network Embeddings via Very Sparse Random Projection

We present FastRP, a scalable and performant algorithm for learning dist...
research
09/09/2023

Non-linear dimension reduction in factor-augmented vector autoregressions

This paper introduces non-linear dimension reduction in factor-augmented...

Please sign up or login with your details

Forgot password? Click here to reset