Generalized Multivariate Signs for Nonparametric Hypothesis Testing in High Dimensions

07/02/2021
by   Subhabrata Majumdar, et al.
0

High-dimensional data, where the dimension of the feature space is much larger than sample size, arise in a number of statistical applications. In this context, we construct the generalized multivariate sign transformation, defined as a vector divided by its norm. For different choices of the norm function, the resulting transformed vector adapts to certain geometrical features of the data distribution. Building up on this idea, we obtain one-sample and two-sample testing procedures for mean vectors of high-dimensional data using these generalized sign vectors. These tests are based on U-statistics using kernel inner products, do not require prohibitive assumptions, and are amenable to a fast randomization-based implementation. Through experiments in a number of data settings, we show that tests using generalized signs display higher power than existing tests, while maintaining nominal type-I error rates. Finally, we provide example applications on the MNIST and Minnesota Twin Studies genomic data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2019

A Nonparametric Normality Test for High-dimensional Data

Many statistical methodologies for high-dimensional data assume the popu...
research
08/11/2020

Test for mean matrix in GMANOVA model under heteroscedasticity and non-normality for high-dimensional data

This paper is concerned with the testing bilateral linear hypothesis on ...
research
04/01/2018

An overview of uniformity tests on the hypersphere

When modeling directional data, that is, unit-norm multivariate vectors,...
research
03/10/2020

A Pairwise Hotelling Method for Testing High-Dimensional Mean Vectors

For high-dimensional small sample size data, Hotelling's T2 test is not ...
research
03/14/2023

Adaptive Testing for High-dimensional Data

In this article, we propose a class of L_q-norm based U-statistics for a...
research
08/17/2018

Concentration Based Inference for High Dimensional (Generalized) Regression Models: New Phenomena in Hypothesis Testing

We develop simple and non-asymptotically justified methods for hypothesi...
research
08/12/2023

Spectral smooth tests for goodness-of-fit

Goodness-of-fit tests are crucial tools for assessing the validity of st...

Please sign up or login with your details

Forgot password? Click here to reset