On Suspicious Coincidences and Pointwise Mutual Information

Barlow (1985) hypothesized that the co-occurrence of two events A and B is "suspicious" if P(A,B) ≫ P(A) P(B). We first review classical measures of association for 2 × 2 contingency tables, including Yule's Y (Yule, 1912), which depends only on the odds ratio λ, and is independent of the marginal probabilities of the table. We then discuss the mutual information (MI) and pointwise mutual information (PMI), which depend on the ratio P(A,B)/P(A)P(B), as measures of association. We show that, once the effect of the marginals is removed, MI and PMI behave similarly to Y as functions of λ. The pointwise mutual information is used extensively in some research communities for flagging suspicious coincidences, but it is important to bear in mind the sensitivity of the PMI to the marginals, with increased scores for sparser events.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/21/2021

On Study of Mutual Information and its Estimation Methods

The presence of mutual information in the research of deep learning has ...
research
02/08/2022

On Sibson's α-Mutual Information

We explore a family of information measures that stems from Rényi's α-Di...
research
10/01/2008

Determining the Unithood of Word Sequences using Mutual Information and Independence Measure

Most works related to unithood were conducted as part of a larger effort...
research
01/04/2018

Modeling Log-linear and Logit Models in Categorical Data Analysis

The association between categorical variables is analyzed using the mutu...
research
09/04/2018

Pointwise HSIC: A Linear-Time Kernelized Co-occurrence Norm for Sparse Linguistic Expressions

In this paper, we propose a new kernel-based co-occurrence measure that ...
research
01/25/2018

Hardware implementation of auto-mutual information function for condition monitoring

This study is aimed at showing applicability of mutual information, name...
research
03/23/2021

Pairwise Adjusted Mutual Information

A well-known metric for quantifying the similarity between two clusterin...

Please sign up or login with your details

Forgot password? Click here to reset