A blended distance to define "people-like-me"

07/11/2022
by   Anaïs Fopma, et al.
0

Curve matching is a prediction technique that relies on predictive mean matching, which matches donors that are most similar to a target based on the predictive distance. Even though this approach leads to high prediction accuracy, the predictive distance may make matches look unconvincing, as the profiles of the matched donors can substantially differ from the profile of the target. To counterbalance this, similarity between the curves of the donors and the target can be taken into account by combining the predictive distance with the Mahalanobis distance into a `blended distance' measure. The properties of this measure are evaluated in two simulation studies. Simulation study I evaluates the performance of the blended distance under different data-generating conditions. The results show that blending towards the Mahalanobis distance leads to worse performance in terms of bias, coverage, and predictive power. Simulation study II evaluates the blended metric in a setting where a single value is imputed. The results show that a property of blending is the bias-variance trade off. Giving more weight to the Mahalanobis distance leads to less variance in the imputations, but less accuracy as well. The main conclusion is that the high prediction accuracy achieved with the predictive distance necessitates the variability in the profiles of donors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/03/2017

On the Unreported-Profile-is-Negative Assumption for Predictive Cheminformatics

In cheminformatics, compound-target binding profiles has been a main sou...
research
11/18/2018

MALTS: Matching After Learning to Stretch

We introduce a flexible framework for matching in causal inference that ...
research
11/30/2020

Data Fusion for Joining Income and Consumption Information Using Different Donor-Recipient Distance Metrics

Data fusion describes the method of combining data from (at least) two i...
research
01/24/2023

Think before you shrink: Alternatives to default shrinkage methods can improve prediction accuracy, calibration and coverage

While shrinkage is essential in high-dimensional settings, its use for l...
research
09/03/2020

Probabilistic Forecasting for Daily Electricity Loads and Quantiles for Curve-to-Curve Regression

Probabilistic forecasting of electricity load curves is of fundamental i...
research
08/27/2018

Combining Predictions of Auto Insurance Claims

This paper aims at achieving better performance of prediction by combini...
research
12/12/2019

Efficient Approximation of the Matching Distance for 2-parameter persistence

The matching distance is a computationally tractable topological measure...

Please sign up or login with your details

Forgot password? Click here to reset