Using Data Compressors to Construct Rank Tests

09/05/2007
by   Daniil Ryabko, et al.
0

Nonparametric rank tests for homogeneity and component independence are proposed, which are based on data compressors. For homogeneity testing the idea is to compress the binary string obtained by ordering the two joint samples and writing 0 if the element is from the first sample and 1 if it is from the second sample and breaking ties by randomization (extension to the case of multiple samples is straightforward). H_0 should be rejected if the string is compressed (to a certain degree) and accepted otherwise. We show that such a test obtained from an ideal data compressor is valid against all alternatives. Component independence is reduced to homogeneity testing by constructing two samples, one of which is the first half of the original and the other is the second half with one of the components randomly permuted.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2020

independence: Fast Rank Tests

In 1948 Hoeffding devised a nonparametric test that detects dependence b...
research
04/29/2021

Nonparametric tests of independence for circular data based on trigonometric moments

We introduce nonparametric tests of independence for bivariate circular ...
research
04/29/2023

Sequential Predictive Two-Sample and Independence Testing

We study the problems of sequential nonparametric two-sample and indepen...
research
04/08/2018

Fast Conditional Independence Test for Vector Variables with Large Sample Sizes

We present and evaluate the Fast (conditional) Independence Test (FIT) -...
research
01/13/2023

Randomization Tests for Adaptively Collected Data

Randomization testing is a fundamental method in statistics, enabling in...
research
04/18/2020

Hidden independence in unstructured probabilistic models

We describe a novel way to represent the probability distribution of a r...

Please sign up or login with your details

Forgot password? Click here to reset