Off-policy evaluation (OPE) is a method for estimating the return of a t...
Off-Policy evaluation (OPE) is concerned with evaluating a new target po...
Off-policy evaluation is critical in a number of applications where new
...
We consider reinforcement learning (RL) methods in offline domains witho...
Policy evaluation based on A/B testing has attracted considerable intere...
This paper is concerned with constructing a confidence interval for a ta...
The two-sided markets such as ride-sharing companies often involve a gro...
Tech companies (e.g., Google or Facebook) often use randomized online
ex...
Order dispatch is one of the central problems to ride-sharing platforms....
A/B testing, or online experiment is a standard business strategy to com...
We propose graphical sure screening, or GRASS, a very simple and
computa...