Weak consistency of the 1-nearest neighbor measure with applications to missing data and covariate shift

02/06/2019
by   James Sharpnack, et al.
0

When data is partially missing at random, imputation and importance weighting are often used to estimate moments of the unobserved population. In this paper, we study 1-nearest neighbor (1NN) imputation, which replaces missing data with the complete data that is the nearest neighbor in the non-missing covariate space. We define an empirical measure, the 1NN measure, and show that it is weakly consistent for the measure of the missing data. The main idea behind this result is that 1NN imputation is performing inverse probability weighting in the limit. We study applications to missing data and assessing the impact of covariate shift in prediction tasks. We conclude with a discussion of using 1NN imputation for domain adaptation in order to alleviate the impact of covariate shift.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2021

Nearest Neighbor-based Importance Weighting

Importance weighting is widely applicable in machine learning in general...
research
02/23/2022

Nearest neighbor ratio imputation with incomplete multi-nomial outcome in survey sampling

Nonresponse is a common problem in survey sampling. Appropriate treatmen...
research
05/15/2022

Inference with Imputed Data: The Allure of Making Stuff Up

Incomplete observability of data generates an identification problem. Th...
research
11/03/2022

Domain Adaptation under Missingness Shift

Rates of missing data often depend on record-keeping policies and thus m...
research
05/15/2022

Imputations for High Missing Rate Data in Covariates via Semi-supervised Learning Approach

Advancements in data collection techniques and the heterogeneity of data...
research
01/07/2021

Distances with mixed type variables some modified Gower's coefficients

Nearest neighbor methods have become popular in official statistics, mai...
research
12/04/2015

Proposition of a Theoretical Model for Missing Data Imputation using Deep Learning and Evolutionary Algorithms

In the last couple of decades, there has been major advancements in the ...

Please sign up or login with your details

Forgot password? Click here to reset