DeepAI AI Chat
Log In Sign Up

Two-sample Behrens–Fisher problems for high-dimensional data: a normal reference F-type test

by   Tianming Zhu, et al.

The problem of testing the equality of mean vectors for high-dimensional data has been intensively investigated in the literature. However, most of the existing tests impose strong assumptions on the underlying group covariance matrices which may not be satisfied or hardly be checked in practice. In this article, an F-type test for two-sample Behrens–Fisher problems for high-dimensional data is proposed and studied. When the two samples are normally distributed and when the null hypothesis is valid, the proposed F-type test statistic is shown to be an F-type mixture, a ratio of two independent chi-square-type mixtures. Under some regularity conditions and the null hypothesis, it is shown that the proposed F-type test statistic and the above F-type mixture have the same normal and non-normal limits. It is then justified to approximate the null distribution of the proposed F-type test statistic by that of the F-type mixture, resulting in the so-called normal reference F-type test. Since the F-type mixture is a ratio of two independent chi-square-type mixtures, we employ the Welch–Satterthwaite chi-square-approximation to the distributions of the numerator and the denominator of the F-type mixture respectively, resulting in an approximation F-distribution whose degrees of freedom can be consistently estimated from the data. The asymptotic power of the proposed F-type test is established. Two simulation studies are conducted and they show that in terms of size control, the proposed F-type test outperforms two existing competitors. The proposed F-type test is also illustrated by a real data example.


page 1

page 2

page 3

page 4


Two-Sample Test for High-Dimensional Covariance Matrices: a normal-reference approach

Testing the equality of the covariance matrices of two high-dimensional ...

The nonparametric Behrens-Fisher problem in small samples

While there appears to be a general consensus in the literature on the d...

Sparse approximation for t-statistics

In the signal plus noise model, it is of interest to quantify the eviden...

Testing High-dimensional Multinomials with Applications to Text Analysis

Motivated by applications in text mining and discrete distribution infer...

A Shrinkage Likelihood Ratio Test for High-Dimensional Subgroup Analysis with a Logistic-Normal Mixture Model

In subgroup analysis, testing the existence of a subgroup with a differe...

Comparing a Large Number of Multivariate Distributions

In this paper, we propose a test for the equality of multiple distributi...

Directional testing for high-dimensional multivariate normal distributions

Thanks to its favorable properties, the multivariate normal distribution...