Poisson PCA: Poisson Measurement Error corrected PCA, with Application to Microbiome Data

04/26/2019
by   Toby Kenney, et al.
0

In this paper, we study the problem of computing a Principal Component Analysis of data affected by Poisson noise. We assume samples are drawn from independent Poisson distributions. We want to estimate principle components of a fixed transformation of the latent Poisson means. Our motivating example is microbiome data, though the methods apply to many other situations. We develop a semiparametric approach to correct the bias of variance estimators, both for untransformed and transformed (with particular attention to log-transformation) Poisson means. Furthermore, we incorporate methods for correcting different exposure or sequencing depth in the data. In addition to identifying the principal components, we also address the non-trivial problem of computing the principal scores in this semiparametric framework. Most previous approaches tend to take a more parametric line. For example the Poisson-log-normal (PLN) model, approach. We compare our method with the PLN approach and find that our method is better at identifying the main principal components of the latent log-transformed Poisson means, and as a further major advantage, takes far less time to compute. Comparing methods on real data, we see that our method also appears to be more robust to outliers than the parametric method.

READ FULL TEXT
research
04/03/2022

Robust PCA for High Dimensional Data based on Characteristic Transformation

In this paper, we propose a novel robust Principal Component Analysis (P...
research
10/24/2019

Robust Principal Component Analysis Based On Maximum Correntropy Power Iterations

Principal component analysis (PCA) is recognised as a quintessential dat...
research
02/05/2021

Robust Principal Component Analysis: A Median of Means Approach

Principal Component Analysis (PCA) is a fundamental tool for data visual...
research
08/16/2021

Flexible Principal Component Analysis for Exponential Family Distributions

Traditional principal component analysis (PCA) is well known in high-dim...
research
03/09/2022

Statistical Depth for Point Process via the Isometric Log-Ratio Transformation

Statistical depth, a useful tool to measure the center-outward rank of m...
research
05/26/2016

Suppressing Background Radiation Using Poisson Principal Component Analysis

Performance of nuclear threat detection systems based on gamma-ray spect...
research
03/22/2022

Dealing with Logs and Zeros in Regression Models

Log-linear models are prevalent in empirical research. Yet, how to handl...

Please sign up or login with your details

Forgot password? Click here to reset