Dimension Reduction for Data with Heterogeneous Missingness

09/24/2021
by   Yurong Ling, et al.
0

Dimension reduction plays a pivotal role in analysing high-dimensional data. However, observations with missing values present serious difficulties in directly applying standard dimension reduction techniques. As a large number of dimension reduction approaches are based on the Gram matrix, we first investigate the effects of missingness on dimension reduction by studying the statistical properties of the Gram matrix with or without missingness, and then we present a bias-corrected Gram matrix with nice statistical properties under heterogeneous missingness. Extensive empirical results, on both simulated and publicly available real datasets, show that the proposed unbiased Gram matrix can significantly improve a broad spectrum of representative dimension reduction approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2019

Dimension reduction in spatial regression with kernel SAVE method

We consider the smoothed version of sliced average variance estimation (...
research
11/02/2010

A Very Fast Algorithm for Matrix Factorization

We present a very fast algorithm for general matrix factorization of a d...
research
05/26/2015

Using Dimension Reduction to Improve the Classification of High-dimensional Data

In this work we show that the classification performance of high-dimensi...
research
08/22/2023

Generalized dimension reduction approach for heterogeneous networked systems with time-delay

Networks of interconnected agents are essential to study complex network...
research
09/09/2023

Non-linear dimension reduction in factor-augmented vector autoregressions

This paper introduces non-linear dimension reduction in factor-augmented...
research
11/30/2015

Universality laws for randomized dimension reduction, with applications

Dimension reduction is the process of embedding high-dimensional data in...
research
02/16/2022

Using the left Gram matrix to cluster high dimensional data

For high dimensional data, where P features for N objects (P >> N) are r...

Please sign up or login with your details

Forgot password? Click here to reset