Gaussian Determinantal Processes: a new model for directionality in data

11/19/2021
by   Subhro Ghosh, et al.
0

Determinantal point processes (a.k.a. DPPs) have recently become popular tools for modeling the phenomenon of negative dependence, or repulsion, in data. However, our understanding of an analogue of a classical parametric statistical theory is rather limited for this class of models. In this work, we investigate a parametric family of Gaussian DPPs with a clearly interpretable effect of parametric modulation on the observed points. We show that parameter modulation impacts the observed points by introducing directionality in their repulsion structure, and the principal directions correspond to the directions of maximal (i.e. the most long ranged) dependency. This model readily yields a novel and viable alternative to Principal Component Analysis (PCA) as a dimension reduction tool that favors directions along which the data is most spread out. This methodological contribution is complemented by a statistical analysis of a spiked model similar to that employed for covariance matrices as a framework to study PCA. These theoretical investigations unveil intriguing questions for further examination in random matrix theory, stochastic geometry and related topics.

READ FULL TEXT
research
12/13/2021

Robust factored principal component analysis for matrix-valued outlier accommodation and detection

Principal component analysis (PCA) is a popular dimension reduction tech...
research
12/31/2018

The Stochastic Complexity of Principal Component Analysis

PCA (principal component analysis) and its variants are ubiquitous techn...
research
01/05/2018

Principal component analysis for big data

Big data is transforming our world, revolutionizing operations and analy...
research
10/04/2021

Row-clustering of a Point Process-valued Matrix

Structured point process data harvested from various platforms poses new...
research
01/01/2023

PCA-based Data Reduction and Signal Separation Techniques for James-Webb Space Telescope Data Processing

Principal Component Analysis (PCA)-based techniques can separate data in...
research
07/16/2014

Sequential Logistic Principal Component Analysis (SLPCA): Dimensional Reduction in Streaming Multivariate Binary-State System

Sequential or online dimensional reduction is of interests due to the ex...
research
12/09/2022

Transportation-Based Functional ANOVA and PCA for Covariance Operators

We consider the problem of comparing several samples of stochastic proce...

Please sign up or login with your details

Forgot password? Click here to reset