Kernel PCA for multivariate extremes

11/23/2022
by   Marco Avella-Medina, et al.
0

We propose kernel PCA as a method for analyzing the dependence structure of multivariate extremes and demonstrate that it can be a powerful tool for clustering and dimension reduction. Our work provides some theoretical insight into the preimages obtained by kernel PCA, demonstrating that under certain conditions they can effectively identify clusters in the data. We build on these new insights to characterize rigorously the performance of kernel PCA based on an extremal sample, i.e., the angular part of random vectors for which the radius exceeds a large threshold. More specifically, we focus on the asymptotic dependence of multivariate extremes characterized by the angular or spectral measure in extreme value theory and provide a careful analysis in the case where the extremes are generated from a linear factor model. We give theoretical guarantees on the performance of kernel PCA preimages of such extremes by leveraging their asymptotic distribution together with Davis-Kahan perturbation bounds. Our theoretical findings are complemented with numerical experiments illustrating the finite sample performance of our methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2021

Spectral learning of multivariate extremes

We propose a spectral clustering algorithm for analyzing the dependence ...
research
12/18/2020

Upper and Lower Bounds on the Performance of Kernel PCA

Principal Component Analysis (PCA) is a popular method for dimension red...
research
09/12/2021

Kernel PCA with the Nyström method

Kernel methods are powerful but computationally demanding techniques for...
research
01/05/2018

Principal component analysis for big data

Big data is transforming our world, revolutionizing operations and analy...
research
01/17/2022

Statistical Inference on a Changing Extremal Dependence Structure

We analyze the extreme value dependence of independent, not necessarily ...
research
08/17/2020

Principal Ellipsoid Analysis (PEA): Efficient non-linear dimension reduction clustering

Even with the rise in popularity of over-parameterized models, simple di...
research
08/13/2020

Informative Clusters for Multivariate Extremes

Capturing the dependence structure of multivariate extreme data is a maj...

Please sign up or login with your details

Forgot password? Click here to reset