Building Representative Matched Samples with Multi-valued Treatments in Large Observational Studies: Analysis of the Impact of an Earthquake on Educational Attainment

10/15/2018
by   Magdalena Bennett, et al.
0

What is the impact of an earthquake on the educational attainment of high school students? In this paper, we address this question using a unique data set and new matching methods. In particular, we use an administrative census of the same students measured before and after the 2010 Chilean earthquake. We propose and analyze new matching methods that overcome three challenges of existing approaches. These new methods allow us: (i) to handle multi-valued treatments without estimating the generalized propensity score; (ii) to build self-weighted matched samples that are representative of a target population by design; and (iii) to work with much larger data sets than other similar approaches. For this, we use a linear-sized mixed integer programming formulation for matching with distributional covariate balance. We formally show that this formulation is more effective than alternative quadratic-sized formulations, as its reduction in size does not affect its strength from the standpoint of its linear programming relaxation. With this formulation, we can handle data sets with hundreds of thousands of observations in a couple of minutes. Using these methods, we show that while increasing levels of exposure to the earthquake have a negative impact on school attendance, there is no effect on university admission test scores.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2018

Matching Algorithms for Causal Inference with Multiple Treatments

Randomized clinical trials (RCTs) are ideal for estimating causal effect...
research
01/12/2022

Using Cardinality Matching to Design Balanced and Representative Samples for Observational Studies

Cardinality matching is a computational method for finding the largest p...
research
11/06/2020

A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees

Several recent publications report advances in training optimal decision...
research
05/20/2021

Profile Matching for the Generalization and Personalization of Causal Inferences

We introduce profile matching, a multivariate matching method for random...
research
04/23/2019

Integer Programming for Learning Directed Acyclic Graphs from Continuous Data

Learning directed acyclic graphs (DAGs) from data is a challenging task ...
research
07/14/2020

Network Flow Methods for the Minimum Covariates Imbalance Problem

The problem of balancing covariates arises in observational studies wher...
research
10/27/2017

An efficient SAT formulation for learning multiple criteria non-compensatory sorting rules from examples

The literature on Multiple Criteria Decision Analysis (MCDA) proposes se...

Please sign up or login with your details

Forgot password? Click here to reset