GCF: Generalized Causal Forest for Heterogeneous Treatment Effect Estimation in Online Marketplace

03/21/2022
by   Shu Wan, et al.
0

Uplift modeling is a rapidly growing approach that utilizes machine learning and causal inference methods to estimate the heterogeneous treatment effects. It has been widely adopted and applied to online marketplaces to assist large-scale decision-making in recent years. The existing popular methods, like forest-based modeling, either work only for discrete treatments or make partially linear or parametric assumptions that may suffer from model misspecification. To alleviate these problems, we extend causal forest (CF) with non-parametric dose-response functions (DRFs) that can be estimated locally using a kernel-based doubly robust estimator. Moreover, we propose a distance-based splitting criterion in the functional space of conditional DRFs to capture the heterogeneity for the continuous treatments. We call the proposed algorithm generalized causal forest (GCF) as it generalizes the use case of CF to a much broader setup. We show the effectiveness of GCF by comparing it to popular uplift modeling models on both synthetic and real-world datasets. We implement GCF in Spark and successfully deploy it into DiDi's real-time pricing system. Online A/B testing results further validate the superiority of GCF.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2023

An Efficient Doubly-Robust Test for the Kernel Treatment Effect

The average treatment effect, which is the difference in expectation of ...
research
04/21/2020

Learning Continuous Treatment Policy and Bipartite Embeddings for Matching with Heterogeneous Causal Effects

Causal inference methods are widely applied in the fields of medicine, p...
research
05/23/2017

Uplift Modeling with Multiple Treatments and General Response Types

Randomized experiments have been used to assist decision-making in many ...
research
02/04/2022

Generalized Causal Tree for Uplift Modeling

Uplift modeling is crucial in various applications ranging from marketin...
research
10/05/2021

Non-parametric interpretable score based estimation of heterogeneous treatment effects

In the study of causal inference, statisticians show growing interest in...
research
05/18/2020

Towards Causal Inference for Spatio-Temporal Data: Conflict and Forest Loss in Colombia

In many data scientific problems, we are interested not only in modeling...
research
01/09/2021

Interpretable Multiple Treatment Revenue Uplift Modeling

Big data and business analytics are critical drivers of business and soc...

Please sign up or login with your details

Forgot password? Click here to reset