Generating knockoffs via conditional independence

12/19/2022
by   Emanuela Dreassi, et al.
0

Let X be a p-variate random vector and X a knockoff copy of X (in the sense of <cit.>). A new approach for constructing X (henceforth, NA) has been introduced in <cit.>. NA has essentially three advantages: (i) To build X is straightforward; (ii) The joint distribution of (X,X) can be written in closed form; (iii) X is often optimal under various criteria, including mean absolute correlation and reconstructability. However, for NA to apply, the distribution of X needs to be of the form (*) P(X_1∈ A_1,…,X_p∈ A_p)=E{∏_i=1^pP(X_i∈ A_i| Z)} for some random element Z. Our first result is that any probability measure μ on ℝ^p can be approximated by a probability measure μ_0 which makes condition (*) true. If μ is absolutely continuous, the approximation holds in total variation distance. In applications, regarding μ as the distribution of X, this result suggests using the knockoffs based on μ_0 instead of those based on μ (which are generally unknown). Our second result is a characterization of the pairs (X,X) where X is obtained via NA. It turns out that (X,X) is of this type if and only if it can be extended to an infinite sequence so as to satisfy certain invariance conditions. The basic tool for proving this fact is de Finetti's theorem for partially exchangeable sequences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2021

New perspectives on knockoffs construction

Let Λ be the collection of all probability distributions for (X,X), wher...
research
09/26/2018

A new Gini correlation between quantitative and qualitative variables

We propose a new Gini correlation to measure dependence between a catego...
research
04/08/2023

De Finetti's Theorem and Related Results for Infinite Weighted Exchangeable Sequences

De Finetti's theorem, also called the de Finetti-Hewitt-Savage theorem, ...
research
04/22/2021

Bayesian predictive inference without a prior

Let (X_n:n≥ 1) be a sequence of random observations. Let σ_n(·)=P(X_n+1∈...
research
09/25/2009

Discrete MDL Predicts in Total Variation

The Minimum Description Length (MDL) principle selects the model that ha...
research
06/06/2023

Parametrization, Prior Independence, and Posterior Asymptotic Normality in the Partially Linear Model

We prove a semiparametric Bernstein-von Mises theorem for a partially li...

Please sign up or login with your details

Forgot password? Click here to reset