Discussion of Kallus (2020) and Mo, Qi, and Liu (2020): New Objectives for Policy Learning

10/09/2020
by   Sijia Li, et al.
0

We discuss the thought-provoking new objective functions for policy learning that were proposed in "More efficient policy learning via optimal retargeting" by Nathan Kallus and "Learning optimal distributionally robust individualized treatment rules" by Weibin Mo, Zhengling Qi, and Yufeng Liu. We show that it is important to take the curvature of the value function into account when working within the retargeting framework, and we introduce two ways to do so. We also describe more efficient approaches for leveraging calibration data when learning distributionally robust policies.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset