A Projection Approach to Local Regression with Variable-Dimension Covariates

02/14/2023
by   Matthew J. Heiner, et al.
0

Incomplete covariate vectors are known to be problematic for estimation and inferences on model parameters, but their impact on prediction performance is less understood. We develop an imputation-free method that builds on a random partition model admitting variable-dimension covariates. Cluster-specific response models further incorporate covariates via linear predictors, facilitating estimation of smooth prediction surfaces with relatively few clusters. We exploit marginalization techniques of Gaussian kernels to analytically project response distributions according to any pattern of missing covariates, yielding a local regression with internally consistent uncertainty propagation that utilizes only one set of coefficients per cluster. Aggressive shrinkage of these coefficients regulates uncertainty due to missing covariates. The method allows in- and out-of-sample prediction for any missingness pattern, even if the pattern in a new subject's incomplete covariate vector was not seen in the training data. We develop an MCMC algorithm for posterior sampling that improves a computationally expensive update for latent cluster allocation. Finally, we demonstrate the model's effectiveness for nonlinear point and density prediction under various circumstances by comparing with other recent methods for regression of variable dimensions on synthetic and real data.

READ FULL TEXT

page 12

page 27

page 40

research
12/31/2019

Prediction in the Presence of Missing Covariates

In many applied fields incomplete covariate vectors are commonly encount...
research
01/04/2018

Cluster-weighted latent class modeling

Usually in Latent Class Analysis (LCA), external predictors are taken to...
research
01/19/2022

Bayesian Prediction with Covariates Subject to Detection Limits

Missing values in covariates due to censoring by signal interference or ...
research
03/17/2021

Multivariate Cluster Weighted Models Using Skewed Distributions

Much work has been done in the area of the cluster weighted model (CWM),...
research
08/04/2021

Linear regression under model uncertainty

We reexamine the classical linear regression model when the model is sub...
research
10/31/2022

Exact and Approximate Conformal Inference in Multiple Dimensions

It is common in machine learning to estimate a response y given covariat...
research
10/15/2022

Clustering blood donors via mixtures of product partition models with covariates

Motivated by the problem of accurately predicting gap times between succ...

Please sign up or login with your details

Forgot password? Click here to reset