Testing Independence of Exchangeable Random Variables

10/22/2022
by   Marcus Hutter, et al.
0

Given well-shuffled data, can we determine whether the data items are statistically (in)dependent? Formally, we consider the problem of testing whether a set of exchangeable random variables are independent. We will show that this is possible and develop tests that can confidently reject the null hypothesis that data is independent and identically distributed and have high power for (some) exchangeable distributions. We will make no structural assumptions on the underlying sample space. One potential application is in Deep Learning, where data is often scraped from the whole internet, with duplications abound, which can render data non-iid and test-set evaluation prone to give wrong answers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2018

Testing for exponentiality for stationary associated random variables

In this paper, we consider the problem of testing for exponentiality aga...
research
05/03/2022

Asymptotic Independence of the Sum and Maximum of Dependent Random Variables with Applications to High-Dimensional Tests

For a set of dependent random variables, without stationary or the stron...
research
04/19/2022

Asymptotic Independence of the Quadratic form and Maximum of Independent Random Variables with Applications to High-Dimensional Tests

This paper establishes the asymptotic independence between the quadratic...
research
04/01/2018

Smoothing-based tests with directional random variables

Testing procedures for assessing specific parametric model forms, or for...
research
08/15/2011

Selectivity in Probabilistic Causality: Drawing Arrows from Inputs to Stochastic Outputs

Given a set of several inputs into a system (e.g., independent variables...
research
02/22/2018

Multidimensional multiscale scanning in Exponential Families: Limit theory and statistical consequences

In this paper we consider the problem of finding anomalies in a d-dimens...
research
11/02/2022

Inferring independent sets of Gaussian variables after thresholding correlations

We consider testing whether a set of Gaussian variables, selected from t...

Please sign up or login with your details

Forgot password? Click here to reset