Balancing Approach for Causal Inference at Scale

02/10/2023
by   Sicheng Lin, et al.
0

With the modern software and online platforms to collect massive amount of data, there is an increasing demand of applying causal inference methods at large scale when randomized experimentation is not viable. Weighting methods that directly incorporate covariate balancing have recently gained popularity for estimating causal effects in observational studies. These methods reduce the manual efforts required by researchers to iterate between propensity score modeling and balance checking until a satisfied covariate balance result. However, conventional solvers for determining weights lack the scalability to apply such methods on large scale datasets in companies like Snap Inc. To address the limitations and improve computational efficiency, in this paper we present scalable algorithms, DistEB and DistMS, for two balancing approaches: entropy balancing and MicroSynth. The solvers have linear time complexity and can be conveniently implemented in distributed computing frameworks such as Spark, Hive, etc. We study the properties of balancing approaches at different scales up to 1 million treated units and 487 covariates. We find that with larger sample size, both bias and variance in the causal effect estimation are significantly reduced. The results emphasize the importance of applying balancing approaches on large scale datasets. We combine the balancing approach with a synthetic control framework and deploy an end-to-end system for causal impact estimation at Snap Inc.

READ FULL TEXT

page 7

page 8

page 9

research
07/27/2021

End-to-End Balancing for Causal Continuous Treatment-Effect Estimation

We study the problem of observational causal inference with continuous t...
research
02/15/2018

DeepMatch: Balancing Deep Covariate Representations for Causal Inference Using Adversarial Training

We study optimal covariate balance for causal inferences from observatio...
research
03/01/2019

A Framework for Covariate Balance using Bregman Distances

A common goal in observational research is to estimate marginal causal e...
research
03/13/2023

Weighted Euclidean balancing for a matrix exposure in estimating causal effect

In many scientific fields such as biology, psychology and sociology, the...
research
10/23/2020

Counterfactual Representation Learning with Balancing Weights

A key to causal inference with observational data is achieving balance i...
research
08/17/2022

Revisiting the propensity score's central role: Towards bridging balance and efficiency in the era of causal machine learning

About forty years ago, in a now–seminal contribution, Rosenbaum Rubi...

Please sign up or login with your details

Forgot password? Click here to reset