Normalized mutual information is a biased measure for classification and community detection

07/03/2023
by   Maximilian Jerdee, et al.
0

Normalized mutual information is widely used as a similarity measure for evaluating the performance of clustering and classification algorithms. In this paper, we show that results returned by the normalized mutual information are biased for two reasons: first, because they ignore the information content of the contingency table and, second, because their symmetric normalization introduces spurious dependence on algorithm output. We introduce a modified version of the mutual information that remedies both of these shortcomings. As a practical demonstration of the importance of using an unbiased measure, we perform extensive numerical tests on a basket of popular algorithms for network community detection and show that one's conclusions about which algorithm is best are significantly affected by the biases in the traditional mutual information.

READ FULL TEXT

page 8

page 10

page 11

research
01/15/2015

Evaluating accuracy of community detection using the relative normalized mutual information

The Normalized Mutual Information (NMI) has been widely used to evaluate...
research
07/29/2019

Improved mutual information measure for classification and community detection

The information theoretic quantity known as mutual information finds wid...
research
12/11/2019

Mutual Information in Community Detection with Covariate Information and Correlated Networks

We study the problem of community detection when there is covariate info...
research
05/19/2023

Unsupervised Scientific Abstract Segmentation with Normalized Mutual Information

The abstracts of scientific papers consist of premises and conclusions. ...
research
09/03/2018

Community detection analysis in wind speed-monitoring systems using mutual information-based complex network

A mutual information-based weighted network representation of a wide win...
research
07/24/2020

Approximately Optimal Binning for the Piecewise Constant Approximation of the Normalized Unexplained Variance (nUV) Dissimilarity Measure

The recently introduced Matching by Tone Mapping (MTM) dissimilarity mea...
research
09/17/2023

Conditional Mutual Information Constrained Deep Learning for Classification

The concepts of conditional mutual information (CMI) and normalized cond...

Please sign up or login with your details

Forgot password? Click here to reset