Bayesian Optimal Two-sample Tests in High-dimension

12/05/2021
by   Kyoungjae Lee, et al.
0

We propose optimal Bayesian two-sample tests for testing equality of high-dimensional mean vectors and covariance matrices between two populations. In many applications including genomics and medical imaging, it is natural to assume that only a few entries of two mean vectors or covariance matrices are different. Many existing tests that rely on aggregating the difference between empirical means or covariance matrices are not optimal or yield low power under such setups. Motivated by this, we develop Bayesian two-sample tests employing a divide-and-conquer idea, which is powerful especially when the difference between two populations is sparse but large. The proposed two-sample tests manifest closed forms of Bayes factors and allow scalable computations even in high-dimensions. We prove that the proposed tests are consistent under relatively mild conditions compared to existing tests in the literature. Furthermore, the testable regions from the proposed tests turn out to be optimal in terms of rates. Simulation studies show clear advantages of the proposed tests over other state-of-the-art methods in various scenarios. Our tests are also applied to the analysis of the gene expression data of two cancer data sets.

READ FULL TEXT

page 15

page 16

page 19

page 20

page 24

page 25

page 26

page 27

research
09/30/2021

Power-enhanced simultaneous test of high-dimensional mean vectors and covariance matrices with application to gene-set testing

Power-enhanced tests with high-dimensional data have received growing at...
research
09/10/2018

Maximum Pairwise Bayes Factors for Covariance Structure Testing

Hypothesis testing of structure in covariance matrices is of significant...
research
12/04/2018

Test of Covariance and Correlation Matrices

Based on a generalized cosine measure between two symmetric matrices, we...
research
12/16/2022

On High Dimensional Behaviour of Some Two-Sample Tests Based on Ball Divergence

In this article, we propose some two-sample tests based on ball divergen...
research
02/28/2020

Statistical applications of Random matrix theory: comparison of two populations II

This paper investigates a statistical procedure for testing the equality...
research
02/27/2019

Profile and Globe Tests of Mean Surfaces for Two-Sample Bivariate Functional Data

Multivariate functional data has received considerable attention but tes...
research
06/26/2023

Incorporating increased variability in testing for cancer DNA methylation

Cancer development is associated with aberrant DNA methylation, includin...

Please sign up or login with your details

Forgot password? Click here to reset