KNIFE: Kernelized-Neural Differential Entropy Estimation

02/14/2022
by   Georg Pichler, et al.
0

Mutual Information (MI) has been widely used as a loss regularizer for training neural networks. This has been particularly effective when learn disentangled or compressed representations of high dimensional data. However, differential entropy (DE), another fundamental measure of information, has not found widespread use in neural network training. Although DE offers a potentially wider range of applications than MI, off-the-shelf DE estimators are either non differentiable, computationally intractable or fail to adapt to changes in the underlying distribution. These drawbacks prevent them from being used as regularizers in neural networks training. To address shortcomings in previously proposed estimators for DE, here we introduce KNIFE, a fully parameterized, differentiable kernel-based estimator of DE. The flexibility of our approach also allows us to construct KNIFE-based estimators for conditional (on either discrete or continuous variables) DE, as well as MI. We empirically validate our method on high-dimensional synthetic data and further apply it to guide the training of neural networks for real-world tasks. Our experiments on a large variety of tasks, including visual domain adaptation, textual fair classification, and textual fine-tuning demonstrate the effectiveness of KNIFE-based estimation. Code can be found at https://github.com/g-pichler/knife.

READ FULL TEXT
research
12/21/2020

Neural Joint Entropy Estimation

Estimating the entropy of a discrete random variable is a fundamental pr...
research
02/21/2023

Scalable Infomin Learning

The task of infomin learning aims to learn a representation with high ut...
research
11/20/2022

Diffeomorphic Information Neural Estimation

Mutual Information (MI) and Conditional Mutual Information (CMI) are mul...
research
05/08/2023

High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

We study the problem of overcoming exponential sample complexity in diff...
research
05/08/2021

Understanding Neural Networks with Logarithm Determinant Entropy Estimator

Understanding the informative behaviour of deep neural networks is chall...
research
03/02/2021

Fast Adaptation with Linearized Neural Networks

The inductive biases of trained neural networks are difficult to underst...
research
05/25/2023

Non-adversarial training of Neural SDEs with signature kernel scores

Neural SDEs are continuous-time generative models for sequential data. S...

Please sign up or login with your details

Forgot password? Click here to reset