Modern Dimension Reduction

03/11/2021
by   Philip D. Waggoner, et al.
22

Data are not only ubiquitous in society, but are increasingly complex both in size and dimensionality. Dimension reduction offers researchers and scholars the ability to make such complex, high dimensional data spaces simpler and more manageable. This Element offers readers a suite of modern unsupervised dimension reduction techniques along with hundreds of lines of R code, to efficiently represent the original high dimensional data space in a simplified, lower dimensional subspace. Launching from the earliest dimension reduction technique principal components analysis and using real social science data, I introduce and walk readers through application of the following techniques: locally linear embedding, t-distributed stochastic neighbor embedding (t-SNE), uniform manifold approximation and projection, self-organizing maps, and deep autoencoders. The result is a well-stocked toolbox of unsupervised algorithms for tackling the complexities of high dimensional data so common in modern society. All code is publicly accessible on Github.

READ FULL TEXT

page 24

page 25

page 26

page 34

research
11/30/2015

Universality laws for randomized dimension reduction, with applications

Dimension reduction is the process of embedding high-dimensional data in...
research
05/22/2020

Rdimtools: An R package for Dimension Reduction and Intrinsic Dimension Estimation

Discovering patterns of the complex high-dimensional data is a long-stan...
research
06/17/2023

Linearly-scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

In many data science applications, the objective is to extract appropria...
research
05/26/2015

Using Dimension Reduction to Improve the Classification of High-dimensional Data

In this work we show that the classification performance of high-dimensi...
research
09/25/2019

Function Preserving Projection for Scalable Exploration of High-Dimensional Data

We present function preserving projections (FPP), a scalable linear proj...
research
05/31/2022

AVIDA: Alternating method for Visualizing and Integrating Data

High-dimensional multimodal data arises in many scientific fields. The i...
research
10/17/2021

Persuasion by Dimension Reduction

How should an agent (the sender) observing multi-dimensional data (the s...

Please sign up or login with your details

Forgot password? Click here to reset