The Generalized Mean Information Coefficient

08/26/2013
by   Alexander Luedtke, et al.
0

Reshef & Reshef recently published a paper in which they present a method called the Maximal Information Coefficient (MIC) that can detect all forms of statistical dependence between pairs of variables as sample size goes to infinity. While this method has been praised by some, it has also been criticized for its lack of power in finite samples. We seek to modify MIC so that it has higher power in detecting associations for limited sample sizes. Here we present the Generalized Mean Information Coefficient (GMIC), a generalization of MIC which incorporates a tuning parameter that can be used to modify the complexity of the association favored by the measure. We define GMIC and prove it maintains several key asymptotic properties of MIC. Its increased power over MIC is demonstrated using a simulation of eight different functional relationships at sixty different noise levels. The results are compared to the Pearson correlation, distance correlation, and MIC. Simulation results suggest that while generally GMIC has slightly lower power than the distance correlation measure, it achieves higher power than MIC for many forms of underlying association. For some functional relationships, GMIC surpasses all other statistics calculated. Preliminary results suggest choosing a moderate value of the tuning parameter for GMIC will yield a test that is robust across underlying relationships. GMIC is a promising new method that mitigates the power issues suffered by MIC, at the possible expense of equitability. Nonetheless, distance correlation was in our simulations more powerful for many forms of underlying relationships. At a minimum, this work motivates further consideration of maximal information-based nonparametric exploration (MINE) methods as statistical tests of independence.

READ FULL TEXT
research
08/19/2013

Distance Correlation Methods for Discovering Associations in Large Astrophysical Databases

High-dimensional, large-sample astrophysical databases of galaxy cluster...
research
10/15/2021

Different coefficients for studying dependence

Through computer simulations, we research several different measures of ...
research
01/27/2013

Equitability Analysis of the Maximal Information Coefficient, with Comparisons

A measure of dependence is said to be equitable if it gives similar scor...
research
03/20/2017

Copula Index for Detecting Dependence and Monotonicity between Stochastic Signals

This paper introduces a nonparametric copula-based approach for detectin...
research
07/24/2020

A Nonparametric Test of Dependence Based on Ensemble of Decision Trees

In this paper, a robust non-parametric measure of statistical dependence...
research
01/16/2018

Compositional Correlation for Detecting Real Associations Among Time Series

Correlation remains to be one of the most widely used statistical tools ...
research
01/28/2022

Two more ways of spelling Gini Coefficient with Applications

In this paper, we draw attention to a promising yet slightly underestima...

Please sign up or login with your details

Forgot password? Click here to reset