Log In Sign Up

Transporting a prediction model for use in a new target population

by   Jon A. Steingrimsson, et al.

We consider methods for transporting a prediction model and assessing its performance for use in a new target population, when outcome and covariate information for model development is available from a simple random sample from the source population, but only covariate information is available on a simple random sample from the target population. We discuss how to tailor the prediction model for use in the target population, how to assess model performance in the target population (e.g., by estimating the target population mean squared error), and how to perform model and tuning parameter selection in the context of the target population. We provide identifiability results for the target population mean squared error of a potentially misspecified prediction model under a sampling design where the source study and the target population samples are obtained separately. We also introduce the concept of prediction error modifiers that can be used to reason about the need for tailoring measures of model performance to the target population and provide an illustration of the methods using simulated data.


page 1

page 2

page 3

page 4


House Price Prediction Using LSTM

In this paper, we use the house price data ranging from January 2004 to ...

Doubly Robust Inference when Combining Probability and Non-probability Samples with High-dimensional Data

Non-probability samples become increasingly popular in survey statistics...

Convergence Analysis of Stochastic Kriging-Assisted Simulation with Random Covariates

We consider performing simulation experiments in the presence of covaria...

Spatio-Temporal Mixed Models to Predict Coverage Error Rates at Local Areas

Despite of the great efforts during the censuses, occurrence of some non...

Variable selection for transportability

Transportability provides a principled framework to address the problem ...

Inertial Single Vehicle Trajectory Prediction Baselines and Applications with the NGSIM Dataset

In the recent vehicle trajectory prediction literature, the most common ...

scatteR: Generating instance space based on scagnostics

Modern synthetic data generators consist of model-based methods where th...