Hidden independence in unstructured probabilistic models

04/18/2020
by   Antony Pearson, et al.
0

We describe a novel way to represent the probability distribution of a random binary string as a mixture having a maximally weighted component associated with independent (though not necessarily identically distributed) Bernoulli characters. We refer to this as the latent independent weight of the probabilistic source producing the string, and derive a combinatorial algorithm to compute it. The decomposition we propose may serve as an alternative to the Boolean paradigm of hypothesis testing, or to assess the fraction of uncorrupted samples originating from a source with independent marginals. In this sense, the latent independent weight quantifies the maximal amount of independence contained within a probabilistic source, which, properly speaking, may not have independent marginals.

READ FULL TEXT

page 1

page 2

page 3

page 4

02/04/2022

Privacy-aware Distributed Hypothesis Testing in Gray-Wyner Network with Side Information

The problem of distributed binary hypothesis testing in the Gray-Wyner n...
02/19/2018

Distributed Hypothesis Testing Over Orthogonal Discrete Memoryless Channels

A distributed binary hypothesis testing problem is studied in which mult...
03/27/2020

Large independent sets on random d-regular graphs with d small

In this paper, we present a prioritized local algorithm that computes a ...
09/05/2007

Using Data Compressors to Construct Rank Tests

Nonparametric rank tests for homogeneity and component independence are ...
08/18/2018

The Capacity of Some Pólya String Models

We study random string-duplication systems, which we call Pólya string m...
07/11/2012

Case-Factor Diagrams for Structured Probabilistic Modeling

We introduce a probabilistic formalism subsuming Markov random fields of...
08/21/2020

Tuple-Independent Representations of Infinite Probabilistic Databases

Probabilistic databases (PDBs) are probability spaces over database inst...