Bayesian Prediction with Covariates Subject to Detection Limits

01/19/2022
by   Caroline Svahn, et al.
0

Missing values in covariates due to censoring by signal interference or lack of sensitivity in the measuring devices are common in industrial problems. We propose a full Bayesian solution to the prediction problem with an efficient Markov Chain Monte Carlo (MCMC) algorithm that updates all the censored covariate values jointly in a random scan Gibbs sampler. We show that the joint updating of missing covariate values can be at least two orders of magnitude more efficient than univariate updating. This increased efficiency is shown to be crucial for quickly learning the missing covariate values and their uncertainty in a real-time decision making context, in particular when there is substantial correlation in the posterior for the missing values. The approach is evaluated on simulated data and on data from the telecom sector. Our results show that the proposed Bayesian imputation gives substantially more accurate predictions than naïve imputation, and that the use of auxiliary variables in the imputation gives additional predictive power.

READ FULL TEXT
research
02/22/2021

Misguided Use of Observed Covariates to Impute Missing Covariates in Conditional Prediction: A Shrinkage Problem

Researchers regularly perform conditional prediction using imputed value...
research
08/16/2022

Semiparametric imputation using latent sparse conditional Gaussian mixtures for multivariate mixed outcomes

This paper proposes a flexible Bayesian approach to multiple imputation ...
research
12/23/2019

Missing data analysis and imputation via latent Gaussian Markov random fields

In this paper we recast the problem of missing values in the covariates ...
research
02/14/2023

A Projection Approach to Local Regression with Variable-Dimension Covariates

Incomplete covariate vectors are known to be problematic for estimation ...
research
11/14/2018

Analysis of Gaussian Spatial Models with Covariate Measurement Error

Uncertainty is an inherent characteristic of biological and geospatial d...
research
03/03/2021

A Hamiltonian Monte Carlo Model for Imputation and Augmentation of Healthcare Data

Missing values exist in nearly all clinical studies because data for a v...
research
06/22/2022

Sharing pattern submodels for prediction with missing values

Missing values are unavoidable in many applications of machine learning ...

Please sign up or login with your details

Forgot password? Click here to reset