Two-sample Behrens–Fisher problems for high-dimensional data: a normal reference F-type test

12/27/2022
by   Tianming Zhu, et al.
0

The problem of testing the equality of mean vectors for high-dimensional data has been intensively investigated in the literature. However, most of the existing tests impose strong assumptions on the underlying group covariance matrices which may not be satisfied or hardly be checked in practice. In this article, an F-type test for two-sample Behrens–Fisher problems for high-dimensional data is proposed and studied. When the two samples are normally distributed and when the null hypothesis is valid, the proposed F-type test statistic is shown to be an F-type mixture, a ratio of two independent chi-square-type mixtures. Under some regularity conditions and the null hypothesis, it is shown that the proposed F-type test statistic and the above F-type mixture have the same normal and non-normal limits. It is then justified to approximate the null distribution of the proposed F-type test statistic by that of the F-type mixture, resulting in the so-called normal reference F-type test. Since the F-type mixture is a ratio of two independent chi-square-type mixtures, we employ the Welch–Satterthwaite chi-square-approximation to the distributions of the numerator and the denominator of the F-type mixture respectively, resulting in an approximation F-distribution whose degrees of freedom can be consistently estimated from the data. The asymptotic power of the proposed F-type test is established. Two simulation studies are conducted and they show that in terms of size control, the proposed F-type test outperforms two existing competitors. The proposed F-type test is also illustrated by a real data example.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2022

Two-Sample Test for High-Dimensional Covariance Matrices: a normal-reference approach

Testing the equality of the covariance matrices of two high-dimensional ...
research
08/02/2022

The nonparametric Behrens-Fisher problem in small samples

While there appears to be a general consensus in the literature on the d...
research
07/03/2023

Sparse approximation for t-statistics

In the signal plus noise model, it is of interest to quantify the eviden...
research
01/03/2023

Testing High-dimensional Multinomials with Applications to Text Analysis

Motivated by applications in text mining and discrete distribution infer...
research
07/18/2023

A Shrinkage Likelihood Ratio Test for High-Dimensional Subgroup Analysis with a Logistic-Normal Mixture Model

In subgroup analysis, testing the existence of a subgroup with a differe...
research
04/11/2019

Comparing a Large Number of Multivariate Distributions

In this paper, we propose a test for the equality of multiple distributi...
research
07/20/2021

Directional testing for high-dimensional multivariate normal distributions

Thanks to its favorable properties, the multivariate normal distribution...

Please sign up or login with your details

Forgot password? Click here to reset