On Approximating Total Variation Distance

06/14/2022
by   Arnab Bhattacharyya, et al.
0

Total variation distance (TV distance) is a fundamental notion of distance between probability distributions. In this work, we introduce and study the computational problem of determining the TV distance between two product distributions over the domain {0,1}^n. We establish the following results. 1. Exact computation of TV distance between two product distributions is #𝖯-complete. This is in stark contrast with other distance measures such as KL, Chi-square, and Hellinger which tensorize over the marginals. 2. Given two product distributions P and Q with marginals of P being at least 1/2 and marginals of Q being at most the respective marginals of P, there exists a fully polynomial-time randomized approximation scheme (FPRAS) for computing the TV distance between P and Q. In particular, this leads to an efficient approximation scheme for the interesting case when P is an arbitrary product distribution and Q is the uniform distribution. We pose the question of characterizing the complexity of approximating the TV distance between two arbitrary product distributions as a basic open problem in computational statistics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2023

Total Variation Distance Estimation Is as Easy as Probabilistic Inference

In this paper, we establish a novel connection between total variation (...
research
08/01/2022

A simple polynomial-time approximation algorithm for the total variation distance between two product distributions

We give a simple polynomial-time approximation algorithm for the total v...
research
05/23/2023

Statistical Indistinguishability of Learning Algorithms

When two different parties use the same learning rule on their own data,...
research
02/10/2019

The Optimal Approximation Factor in Density Estimation

Consider the following problem: given two arbitrary densities q_1,q_2 an...
research
01/21/2020

When does the Tukey median work?

We analyze the performance of the Tukey median estimator under total var...
research
04/19/2022

Independence Testing for Bounded Degree Bayesian Network

We study the following independence testing problem: given access to sam...
research
04/22/2016

Learning a Tree-Structured Ising Model in Order to Make Predictions

We study the problem of learning a tree graphical model from samples suc...

Please sign up or login with your details

Forgot password? Click here to reset