Information Estimation Using Non-Parametric Copulas

07/20/2018
by   Houman Safaai, et al.
0

Estimation of mutual information between random variables has become crucial in a range of fields, from physics to neuroscience to finance. Estimating information accurately over a wide range of conditions relies on the development of flexible methods to describe statistical dependencies among variables, without imposing potentially invalid assumptions on the data. Such methods are needed in cases that lack prior knowledge of their statistical properties and that have limited sample numbers. Here we propose a powerful and generally applicable information estimator based on non-parametric copulas. This estimator, called the non-parametric copula-based estimator (NPC), is tailored to take into account detailed stochastic relationships in the data independently of the data's marginal distributions. The NPC estimator can be used both for continuous and discrete numerical variables and thus provides a single framework for the mutual information estimation of both continuous and discrete data. By extensive validation on artificial samples drawn from various statistical distributions, we found that the NPC estimator compares well against commonly used alternatives. Unlike methods not based on copulas, it allows an estimation of information that is robust to changes of the details of the marginal distributions. Unlike parametric copula methods, it remains accurate regardless of the precise form of the interactions between the variables. In addition, the NPC estimator had accurate information estimates even at low sample numbers, in comparison to alternative estimators. The NPC estimator therefore provides a good balance between general applicability to arbitrarily shaped statistical dependencies in the data and shows accurate and robust performance when working with small sample sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2017

Learning non-parametric Markov networks with mutual information

We propose a method for learning Markov network structures for continuou...
research
08/03/2020

Parametric Copula-GP model for analyzing multidimensional neuronal and behavioral relationships

One of the main challenges in current systems neuroscience is the analys...
research
06/10/2020

Higher-order interactions in statistical physics and machine learning: A non-parametric solution to the inverse problem

We propose a model-independent definition of n-point interaction within ...
research
01/27/2018

Scalable Mutual Information Estimation using Dependence Graphs

We propose a unified method for empirical non-parametric estimation of g...
research
04/30/2018

On the Effect of Suboptimal Estimation of Mutual Information in Feature Selection and Classification

This paper introduces a new property of estimators of the strength of st...
research
11/22/2022

Optimal design of the Wilcoxon-Mann-Whitney-test

In scientific research, many hypotheses relate to the comparison of two ...
research
01/27/2021

The Most Informative Order Statistic and its Application to Image Denoising

We consider the problem of finding the subset of order statistics that c...

Please sign up or login with your details

Forgot password? Click here to reset