Low-rank matrix denoising for count data using unbiased Kullback-Leibler risk estimation

01/28/2020
by   Jérémie Bigot, et al.
0

This paper is concerned by the analysis of observations organized in a matrix form whose elements are count data assumed to follow a Poisson or a multinomial distribution. We focus on the estimation of either the intensity matrix (Poisson case) or the compositional matrix (multinomial case) that is assumed to have a low rank structure. We propose to construct an estimator minimizing the regularized negative log-likelihood by a nuclear norm penalty. Our approach easily yields a low-rank matrix-valued estimator with positive entries which belongs to the set of row-stochastic matrices in the multinomial case. Then, our main contribution is to propose a data-driven way to select the regularization parameter in the construction of such estimators by minimizing (approximately) unbiased estimates of the Kullback-Leibler (KL) risk in such models, which generalize Stein's unbiased risk estimation originally proposed for Gaussian data. The evaluation of these quantities is a delicate problem, and we introduce novel methods to obtain accurate numerical approximation of such unbiased estimates. Simulated data are used to validate this way of selecting regularizing parameters for low-rank matrix estimation from count data. Examples from a survey study and metagenomics also illustrate the benefits of our approach for real data analysis.

READ FULL TEXT

page 13

page 22

page 23

page 24

page 25

page 26

page 29

page 31

research
07/11/2019

Low-rank matrix completion and denoising under Poisson noise

This paper considers the problem of estimating a low-rank matrix from th...
research
05/18/2015

Towards Faster Rates and Oracle Property for Low-Rank Matrix Estimation

We present a unified framework for low-rank matrix estimation with nonco...
research
05/28/2013

Adaptive estimation of the copula correlation matrix for semiparametric elliptical copulas

We study the adaptive estimation of copula correlation matrix Σ for the ...
research
02/17/2018

Nonparametric estimation of low rank matrix valued function

Let A:[0,1]→H_m (the space of Hermitian matrices) be a matrix valued fun...
research
06/15/2021

Multi-sample estimation of centered log-ratio matrix in microbiome studies

In microbiome studies, one of the ways of studying bacterial abundances ...
research
10/27/2021

Poisson PCA for matrix count data

We develop a dimension reduction framework for data consisting of matric...
research
11/11/2020

Predictive risk estimation for the Expectation Maximization algorithm with Poisson data

In this work, we introduce a novel estimator of the predictive risk with...

Please sign up or login with your details

Forgot password? Click here to reset