Large-Scale Multiple Testing for Matrix-Valued Data under Double Dependency

06/17/2021
by   Xu Han, et al.
0

High-dimensional inference based on matrix-valued data has drawn increasing attention in modern statistical research, yet not much progress has been made in large-scale multiple testing specifically designed for analysing such data sets. Motivated by this, we consider in this article an electroencephalography (EEG) experiment that produces matrix-valued data and presents a scope of developing novel matrix-valued data based multiple testing methods controlling false discoveries for hypotheses that are of importance in such an experiment. The row-column cross-dependency of observations appearing in a matrix form, referred to as double-dependency, is one of the main challenges in the development of such methods. We address it by assuming matrix normal distribution for the observations at each of the independent matrix data-points. This allows us to fully capture the underlying double-dependency informed through the row- and column-covariance matrices and develop methods that are potentially more powerful than the corresponding one (e.g., Fan and Han (2017)) obtained by vectorizing each data point and thus ignoring the double-dependency. We propose two methods to approximate the false discovery proportion with statistical accuracy. While one of these methods is a general approach under double-dependency, the other one provides more computational efficiency for higher dimensionality. Extensive numerical studies illustrate the superior performance of the proposed methods over the principal factor approximation method of Fan and Han (2017). The proposed methods have been further applied to the aforementioned EEG data.

READ FULL TEXT

page 32

page 33

research
03/23/2020

Projected Estimation for Large-dimensional Matrix Factor Models

Large-dimensional factor models are drawing growing attention and widely...
research
04/11/2020

Covariance Estimation for Matrix-valued Data

Covariance estimation for matrix-valued data has received an increasing ...
research
01/01/2023

Iterative Least Squares Algorithm for Large-dimensional Matrix Factor Model by Random Projection

The matrix factor model has drawn growing attention for its advantage in...
research
02/24/2022

Multiple multi-sample testing under arbitrary covariance dependency

Modern high-throughput biomedical devices routinely produce data on a la...
research
08/18/2021

Multiple two-sample testing under arbitrary covariance dependency with an application in imaging mass spectrometry

Large-scale hypothesis testing has become a ubiquitous problem in high-d...
research
06/30/2020

Testing and Support Recovery of Correlation Structures for Matrix-Valued Observations with an Application to Stock Market Data

Estimation of the covariance matrix of asset returns is crucial to portf...
research
01/11/2022

The Poisson Multinomial Distribution and Its Applications in Voting Theory, Ecological Inference, and Machine Learning

The Poisson multinomial distribution (PMD) describes the distribution of...

Please sign up or login with your details

Forgot password? Click here to reset