Listwise Deletion in High Dimensions

01/12/2021
by   J. Sophia Wang, et al.
0

We consider the properties of listwise deletion when both n and the number of variables grow large. We show that when (i) all data has some idiosyncratic missingness and (ii) the number of variables grows superlogarithmically in n, then, for large n, listwise deletion will drop all rows with probability 1. We present numerical illustrations to demonstrate finite-n implications. These results suggest, in practice, using listwise deletion may mean using few of the variables available to the researcher even when n is very large.

READ FULL TEXT
research
06/20/2018

Correcting an ordered deletion-erasure

In this paper, we show that the single deletion correcting Varshamov-Ten...
research
07/06/2020

Descent-to-Delete: Gradient-Based Methods for Machine Unlearning

We study the data deletion problem for convex models. By leveraging tech...
research
05/25/2022

Deletion and Insertion Tests in Regression Models

A basic task in explainable AI (XAI) is to identify the most important f...
research
01/15/2020

Insertion-Deletion Transformer

We propose the Insertion-Deletion Transformer, a novel transformer-based...
research
08/25/2023

ExD: Explainable Deletion

This paper focuses on a critical yet often overlooked aspect of data in ...
research
12/28/2020

On deletion diagnostic statistic in regression

A brief review about choosing a normalizing or scaling matrix for deleti...
research
02/18/2022

Debiasing Backdoor Attack: A Benign Application of Backdoor Attack in Eliminating Data Bias

Backdoor attack is a new AI security risk that has emerged in recent yea...

Please sign up or login with your details

Forgot password? Click here to reset