A Normal Test for Independence via Generalized Mutual Information

07/19/2022
by   Jialin Zhang, et al.
0

Testing hypothesis of independence between two random elements on a joint alphabet is a fundamental exercise in statistics. Pearson's chi-squared test is an effective test for such a situation when the contingency table is relatively small. General statistical tools are lacking when the contingency data tables are large or sparse. A test based on generalized mutual information is derived and proposed in this article. The new test has two desired theoretical properties. First, the test statistic is asymptotically normal under the hypothesis of independence; consequently it does not require the knowledge of the row and column sizes of the contingency table. Second, the test is consistent and therefore it would detect any form of dependence structure in the general alternative space given a sufficiently large sample. In addition, simulation studies show that the proposed test converges faster than Pearson's chi-squared test when the contingency table is large or sparse.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2021

USP: an independence test that improves on Pearson's chi-squared and the G-test

We present the U-Statistic Permutation (USP) test of independence in the...
research
11/17/2017

Nonparametric independence testing via mutual information

We propose a test of independence of two multivariate random vectors, gi...
research
10/27/2021

Data-Driven Representations for Testing Independence: Modeling, Analysis and Connection with Mutual Information Estimation

This work addresses testing the independence of two continuous and finit...
research
07/10/2021

A test for normality and independence based on characteristic function

In this article we prove a generalization of the Ejsmont characterizatio...
research
08/28/2018

Seven proofs of the Pearson Chi-squared independence test and its graphical interpretation

This paper revisits the Pearson Chi-squared independence test. After pre...
research
02/23/2021

Goodness-of-fit Test on the Number of Biclusters in Relational Data Matrix

Biclustering is a method for detecting homogeneous submatrices in a give...
research
03/10/2021

Extension of the Lagrange multiplier test for error cross-section independence to large panels with non normal errors

This paper reexamines the seminal Lagrange multiplier test for cross-sec...

Please sign up or login with your details

Forgot password? Click here to reset