An information criterion for auxiliary variable selection in incomplete data analysis

02/21/2019
by   Shinpei Imori, et al.
0

Statistical inference is considered for variables of interest, called primary variables, when auxiliary variables are observed along with the primary variables. We consider the setting of incomplete data analysis, where some primary variables are not observed. Utilizing a parametric model of joint distribution of primary and auxiliary variables, it is possible to improve the estimation of parametric model for the primary variables when the auxiliary variables are closely related to the primary variables. However, the estimation accuracy reduces when the auxiliary variables are irrelevant to the primary variables. For selecting useful auxiliary variables, we formulate the problem as model selection, and propose an information criterion for predicting primary variables by leveraging auxiliary variables. The proposed information criterion is an asymptotically unbiased estimator of the Kullback-Leibler divergence for complete data of primary variables under some reasonable conditions. We also clarify an asymptotic equivalence between the proposed information criterion and a variant of leave-one-out cross validation. Performance of our method is demonstrated via a simulation study and a real data example.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2022

A comparison of strategies for selecting auxiliary variables for multiple imputation

Multiple imputation (MI) is a popular method for handling missing data. ...
research
10/23/2021

Prior Intensified Information Criterion

The widely applicable information criterion (WAIC) has been used as a mo...
research
11/10/2015

Incorporating Knowledge into Structural Equation Models using Auxiliary Variables

In this paper, we extend graph-based identification methods by allowing ...
research
02/26/2021

Active Selection of Classification Features

Some data analysis applications comprise datasets, where explanatory var...
research
05/31/2023

Reinforced Borrowing Framework: Leveraging Auxiliary Data for Individualized Inference

Increasingly during the past decade, researchers have sought to leverage...
research
03/12/2021

Orthogonal Statistical Inference for Multimodal Data Analysis

Multimodal imaging has transformed neuroscience research. While it prese...
research
10/31/2020

On the Use of Auxiliary Variables in Multilevel Regression and Poststratification

Multilevel regression and poststratification (MRP) has been a popular ap...

Please sign up or login with your details

Forgot password? Click here to reset