Data-Driven Representations for Testing Independence: Modeling, Analysis and Connection with Mutual Information Estimation

10/27/2021
by   Mauricio E. Gonzalez, et al.
0

This work addresses testing the independence of two continuous and finite-dimensional random variables from the design of a data-driven partition. The empirical log-likelihood statistic is adopted to approximate the sufficient statistics of an oracle test against independence (that knows the two hypotheses). It is shown that approximating the sufficient statistics of the oracle test offers a learning criterion for designing a data-driven partition that connects with the problem of mutual information estimation. Applying these ideas in the context of a data-dependent tree-structured partition (TSP), we derive conditions on the TSP's parameters to achieve a strongly consistent distribution-free test of independence over the family of probabilities equipped with a density. Complementing this result, we present finite-length results that show our TSP scheme's capacity to detect the scenario of independence structurally with the data-driven partition as well as new sampling complexity bounds for this detection. Finally, some experimental analyses provide evidence regarding our scheme's advantage for testing independence compared with some strategies that do not use data-driven representations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/18/2022

DIET: Conditional independence testing with marginal dependence measures of residual information

Conditional randomization tests (CRTs) assess whether a variable x is pr...
research
07/19/2022

A Normal Test for Independence via Generalized Mutual Information

Testing hypothesis of independence between two random elements on a join...
research
02/10/2020

A Test for Independence Via Bayesian Nonparametric Estimation of Mutual Information

Mutual information is a well-known tool to measure the mutual dependence...
research
11/17/2017

Nonparametric independence testing via mutual information

We propose a test of independence of two multivariate random vectors, gi...
research
11/09/2020

Near-Optimal Learning of Tree-Structured Distributions by Chow-Liu

We provide finite sample guarantees for the classical Chow-Liu algorithm...
research
10/19/2020

independence: Fast Rank Tests

In 1948 Hoeffding devised a nonparametric test that detects dependence b...
research
01/15/2020

Automated extraction of mutual independence patterns using Bayesian comparison of partition models

Mutual independence is a key concept in statistics that characterizes th...

Please sign up or login with your details

Forgot password? Click here to reset