Combining Offline Causal Inference and Online Bandit Learning for Data Driven Decisions

01/16/2020
by   Li Ye, et al.
0

A fundamental question for companies is: How to make good decisions with the increasing amount of logged data?. Currently, companies are doing online tests (e.g. A/B tests) before making decisions. However, online tests can be expensive because testing inferior decisions hurt users' experiences. On the other hand, offline causal inference analyzes logged data alone to make decisions, but once a wrong decision is made by the offline causal inference, this wrong decision will continuously to hurt all users' experience. In this paper, we unify offline causal inference and online bandit learning to make the right decision. Our framework is flexible to incorporate various causal inference methods (e.g. matching, weighting) and online bandit methods (e.g. UCB, LinUCB). For these novel combination of algorithms, we derive theoretical bounds on the decision maker's "regret" compared to its optimal decision. We also derive the first regret bound for forest-based online bandit algorithms. Experiments on synthetic data show that our algorithms outperform methods that use only the logged data or only the online feedbacks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2016

Causal Bandits: Learning Good Interventions via Causal Inference

We study the problem of using causal models to improve the rate at which...
research
08/18/2023

Active and Passive Causal Inference Learning

This paper serves as a starting point for machine learning researchers, ...
research
07/15/2023

Unveiling Bias in Sequential Decision Making: A Causal Inference Approach for Stochastic Service Systems

In many stochastic service systems, decision-makers find themselves maki...
research
10/16/2012

A Bayesian Approach to Constraint Based Causal Inference

We target the problem of accuracy and robustness in causal inference fro...
research
03/14/2022

Introducing causal inference in the energy-efficient building design process

"What-if" questions are intuitively generated and commonly asked during ...
research
02/18/2021

Online Learning via Offline Greedy Algorithms: Applications in Market Design and Optimization

Motivated by online decision-making in time-varying combinatorial enviro...
research
11/26/2021

Online Causal Inference with Application to Near Real-Time Post-Market Vaccine Safety Surveillance

Streaming data routinely generated by mobile phones, social networks, e-...

Please sign up or login with your details

Forgot password? Click here to reset