Estimating Conditional Mutual Information for Discrete-Continuous Mixtures using Multi-Dimensional Adaptive Histograms

01/13/2021
by   Alexander Marx, et al.
0

Estimating conditional mutual information (CMI) is an essential yet challenging step in many machine learning and data mining tasks. Estimating CMI from data that contains both discrete and continuous variables, or even discrete-continuous mixture variables, is a particularly hard problem. In this paper, we show that CMI for such mixture variables, defined based on the Radon-Nikodym derivate, can be written as a sum of entropies, just like CMI for purely discrete or continuous data. Further, we show that CMI can be consistently estimated for discrete-continuous mixture variables by learning an adaptive histogram model. In practice, we estimate such a model by iteratively discretizing the continuous data points in the mixture variables. To evaluate the performance of our estimator, we benchmark it against state-of-the-art CMI estimators as well as evaluate it in a causal discovery setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2019

Conditional Mutual Information Estimation for Mixed Discrete and Continuous Variables with Nearest Neighbors

Fields like public health, public policy, and social science often want ...
research
11/14/2022

Generalized Stable Weights via Neural Gibbs Density

We present a generalized balancing weight method fully available for est...
research
11/20/2022

Diffeomorphic Information Neural Estimation

Mutual Information (MI) and Conditional Mutual Information (CMI) are mul...
research
06/30/2019

Estimating Information-Theoretic Quantities with Random Forests

Information-theoretic quantities, such as mutual information and conditi...
research
10/26/2021

Estimating Mutual Information via Geodesic kNN

Estimating mutual information (MI) between two continuous random variabl...
research
11/25/2017

Feature Selection Facilitates Learning Mixtures of Discrete Product Distributions

Feature selection can facilitate the learning of mixtures of discrete ra...
research
04/01/2021

Reconciling the Discrete-Continuous Divide: Towards a Mathematical Theory of Sparse Communication

Neural networks and other machine learning models compute continuous rep...

Please sign up or login with your details

Forgot password? Click here to reset