A More Powerful Two-Sample Test in High Dimensions using Random Projection

08/11/2011
by   Miles E. Lopes, et al.
0

We consider the hypothesis testing problem of detecting a shift between the means of two multivariate normal distributions in the high-dimensional setting, allowing for the data dimension p to exceed the sample size n. Specifically, we propose a new test statistic for the two-sample test of means that integrates a random projection with the classical Hotelling T^2 statistic. Working under a high-dimensional framework with (p,n) tending to infinity, we first derive an asymptotic power function for our test, and then provide sufficient conditions for it to achieve greater power than other state-of-the-art tests. Using ROC curves generated from synthetic data, we demonstrate superior performance against competing tests in the parameter regimes anticipated by our theoretical results. Lastly, we illustrate an advantage of our procedure's false positive rate with comparisons on high-dimensional gene expression data involving the discrimination of different types of cancer.

READ FULL TEXT
research
03/13/2020

Two-Sample High Dimensional Mean Test Based On Prepivots

Testing equality of mean vectors is a very commonly used criterion when ...
research
10/03/2020

Randomized tests for high-dimensional regression: A more efficient and powerful solution

We investigate the problem of testing the global null in the high-dimens...
research
03/02/2018

Robust Multivariate Nonparametric Tests via Projection-Pursuit

In this work, we generalize the Cramér-von Mises statistic via projectio...
research
07/09/2019

Conditional Independence Testing using Generative Adversarial Networks

We consider the hypothesis testing problem of detecting conditional depe...
research
05/10/2020

Statistical inference for the EU portfolio in high dimensions

In this paper, using the shrinkage-based approach for portfolio weights ...
research
12/17/2017

Hypothesis Testing for High-Dimensional Multinomials: A Selective Review

The statistical analysis of discrete data has been the subject of extens...
research
10/29/2021

Multiple-Splitting Projection Test for High-Dimensional Mean Vectors

We propose a multiple-splitting projection test (MPT) for one-sample mea...

Please sign up or login with your details

Forgot password? Click here to reset