independence: Fast Rank Tests

10/19/2020
by   Chaim Even-Zohar, et al.
0

In 1948 Hoeffding devised a nonparametric test that detects dependence between two continuous random variables X and Y, based on the ranking of n paired samples (Xi,Yi). The computation of this commonly-used test statistic takes O(n log n) time. Hoeffding's test is consistent against any dependent probability density f(x,y), but can be fooled by other bivariate distributions with continuous margins. Variants of this test with full consistency have been considered by Blum, Kiefer, and Rosenblatt (1961), Yanagimoto (1970), Bergsma and Dassios (2010). The so far best known algorithms to compute these stronger independence tests have required quadratic time. Here we improve their run time to O(n log n), by elaborating on new methods for counting ranking patterns, from a recent paper by the author and Leng (SODA'21). Therefore, in all circumstances under which the classical Hoeffding independence test is applicable, we provide novel competitive algorithms for consistent testing against all alternatives. Our R package, independence, offers a highly optimized implementation of these rank-based tests. We demonstrate its capabilities on large-scale datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2018

Fast Conditional Independence Test for Vector Variables with Large Sample Sizes

We present and evaluate the Fast (conditional) Independence Test (FIT) -...
research
09/05/2007

Using Data Compressors to Construct Rank Tests

Nonparametric rank tests for homogeneity and component independence are ...
research
01/10/2013

A Bayesian Multiresolution Independence Test for Continuous Variables

In this paper we present a method ofcomputing the posterior probability ...
research
11/17/2020

A kernel test for quasi-independence

We consider settings in which the data of interest correspond to pairs o...
research
06/25/2016

Large-Scale Kernel Methods for Independence Testing

Representations of probability measures in reproducing kernel Hilbert sp...
research
08/09/2019

An Independence Test Based on Recurrence Rates

A new test of independence between random elements is presented in this ...
research
10/27/2021

Data-Driven Representations for Testing Independence: Modeling, Analysis and Connection with Mutual Information Estimation

This work addresses testing the independence of two continuous and finit...

Please sign up or login with your details

Forgot password? Click here to reset