Randomized Dimension Reduction on Massive Data

11/07/2012
by   Stoyan Georgiev, et al.
0

Scalability of statistical estimators is of increasing importance in modern applications and dimension reduction is often used to extract relevant information from data. A variety of popular dimension reduction approaches can be framed as symmetric generalized eigendecomposition problems. In this paper we outline how taking into account the low rank structure assumption implicit in these dimension reduction approaches provides both computational and statistical advantages. We adapt recent randomized low-rank approximation algorithms to provide efficient solutions to three dimension reduction methods: Principal Component Analysis (PCA), Sliced Inverse Regression (SIR), and Localized Sliced Inverse Regression (LSIR). A key observation in this paper is that randomization serves a dual role, improving both computational and statistical performance. This point is highlighted in our experiments on real and simulated data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2015

Adaptive Randomized Dimension Reduction on Massive Data

The scalability of statistical estimators is of increasing importance in...
research
04/23/2022

Spherical Rotation Dimension Reduction with Geometric Loss Functions

Modern datasets witness high-dimensionality and nontrivial geometries of...
research
10/27/2021

Poisson PCA for matrix count data

We develop a dimension reduction framework for data consisting of matric...
research
08/31/2018

A novel extension of Generalized Low-Rank Approximation of Matrices based on multiple-pairs of transformations

Dimension reduction is a main step in learning process which plays a ess...
research
08/17/2020

Principal Ellipsoid Analysis (PEA): Efficient non-linear dimension reduction clustering

Even with the rise in popularity of over-parameterized models, simple di...
research
06/20/2020

Weakly-correlated synapses promote dimension reduction in deep neural networks

By controlling synaptic and neural correlations, deep learning has achie...
research
09/15/2016

Recursive nearest agglomeration (ReNA): fast clustering for approximation of structured signals

-In this work, we revisit fast dimension reduction approaches, as with r...

Please sign up or login with your details

Forgot password? Click here to reset