Did You Mean...? Confidence-based Trade-offs in Semantic Parsing

03/29/2023
by   Elias Stengel-Eskin, et al.
0

We illustrate how a calibrated model can help balance common trade-offs in task-oriented parsing. In a simulated annotator-in-the-loop experiment, we show that well-calibrated confidence scores allow us to balance cost with annotator load, improving accuracy with a small number of interactions. We then examine how confidence scores can help optimize the trade-off between usability and safety. We show that confidence-based thresholding can substantially reduce the number of incorrect low-confidence programs executed; however, this comes at a cost to usability. We propose the DidYouMean system which better balances usability and safety.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset