Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

09/22/2021
by   Eli Ben-Michael, et al.
0

Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these and other data-driven policies are based on known, deterministic rules to ensure their transparency and interpretability. This is especially true when such policies are used for public policy decision-making. For example, algorithmic pre-trial risk assessments, which serve as our motivating application, provide relatively simple, deterministic classification scores and recommendations to help judges make release decisions. Unfortunately, existing methods for policy learning are not applicable because they require existing policies to be stochastic rather than deterministic. We develop a robust optimization approach that partially identifies the expected utility of a policy, and then finds an optimal policy by minimizing the worst-case regret. The resulting policy is conservative but has a statistical safety guarantee, allowing the policy-maker to limit the probability of producing a worse outcome than the existing policy. We extend this approach to common and important settings where humans make decisions with the aid of algorithmic recommendations. Lastly, we apply the proposed methodology to a unique field experiment on pre-trial risk assessments. We derive new classification and recommendation rules that retain the transparency and interpretability of the existing risk assessment instrument while potentially leading to better overall outcomes at a lower cost.

READ FULL TEXT

page 35

page 37

research
12/04/2020

Experimental Evaluation of Algorithm-Assisted Human Decision-Making: Application to Pretrial Public Safety Assessment

Despite an increasing reliance on fully-automated algorithmic decision m...
research
07/17/2023

Bayesian Safe Policy Learning with Chance Constrained Optimization: Application to Military Security Assessment during the Vietnam War

Algorithmic and data-driven decisions and recommendations are commonly u...
research
01/23/2020

The impact of overbooking on a pre-trial risk assessment tool

Pre-trial risk assessment tools are used to make recommendations to judg...
research
03/17/2023

Policy/mechanism separation in the Warehouse-Scale OS

"As many of us know from bitter experience, the policies provided in ext...
research
06/21/2022

Policy learning with asymmetric utilities

Data-driven decision making plays an important role even in high stakes ...
research
09/05/2022

The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations

Many high-stake decisions follow an expert-in-loop structure in that a h...
research
08/29/2022

Safe Policy Learning under Regression Discontinuity Designs

The regression discontinuity (RD) design is widely used for program eval...

Please sign up or login with your details

Forgot password? Click here to reset