Profile Matching for the Generalization and Personalization of Causal Inferences

by   Eric R. Cohn, et al.

We introduce profile matching, a multivariate matching method for randomized experiments and observational studies that finds the largest possible self-weighted samples across multiple treatment groups that are balanced relative to a covariate profile. This covariate profile can represent a specific population or a target individual, facilitating the tasks of generalization and personalization of causal inferences. For generalization, because the profile often amounts to summary statistics for a target population, profile matching does not require accessing individual-level data, which may be unavailable for confidentiality reasons. For personalization, the profile can characterize a single patient. Profile matching achieves covariate balance by construction, but unlike existing approaches to matching, it does not require specifying a matching ratio, as this is implicitly optimized for the data. The method can also be used for the selection of units for study follow-up, and it readily applies to multi-valued treatments with many treatment categories. We evaluate the performance of profile matching in a simulation study of generalization of a randomized trial to a target population. We further illustrate this method in an exploratory observational study of the relationship between opioid use treatment and mental health outcomes. We analyze these relationships for three covariate profiles representing: (i) sexual minorities, (ii) the Appalachian United States, and (iii) a hypothetical vulnerable patient. We provide R code with step-by-step explanations to implement the methods in the paper in the Supplementary Materials.


page 1

page 2

page 3

page 4


Matching Algorithms for Causal Inference with Multiple Treatments

Randomized clinical trials (RCTs) are ideal for estimating causal effect...

One-step weighting to generalize and transport treatment effect estimates to a target population

Weighting methods are often used to generalize and transport estimates o...

Towards an Inferential Lexicon of Event Selecting Predicates for French

We present a manually constructed seed lexicon encoding the inferential ...

Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and Analysis of Observational Studies

Matching alleviates the problem of covariate imbalance in observational ...

Supervised Robust Profile Clustering

In many studies, dimension reduction methods are used to profile partici...

Learning relationships between data obtained independently

The aim of this paper is to provide a new method for learning the relati...