Did You Mean...? Confidence-based Trade-offs in Semantic Parsing

03/29/2023
by   Elias Stengel-Eskin, et al.
0

We illustrate how a calibrated model can help balance common trade-offs in task-oriented parsing. In a simulated annotator-in-the-loop experiment, we show that well-calibrated confidence scores allow us to balance cost with annotator load, improving accuracy with a small number of interactions. We then examine how confidence scores can help optimize the trade-off between usability and safety. We show that confidence-based thresholding can substantially reduce the number of incorrect low-confidence programs executed; however, this comes at a cost to usability. We propose the DidYouMean system which better balances usability and safety.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2022

Calibrated Interpretation: Confidence Estimation in Semantic Parsing

Task-oriented semantic parsing is increasingly being used in user-facing...
research
10/07/2019

Designing Interfaces to Help Stakeholders Comprehend, Navigate, and Manage Algorithmic Trade-Offs

Artificial intelligence algorithms have been applied to a wide variety o...
research
01/02/2021

Empirical Decision Rules for Improving the Uncertainty Reporting of Small Sample System Usability Scale Scores

The System Usability Scale (SUS) is a short, survey-based approach used ...
research
01/17/2023

Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks

Industry practitioners always face the problem of choosing the appropria...
research
03/04/2019

An Adversarial Super-Resolution Remedy for Radar Design Trade-offs

Radar is of vital importance in many fields, such as autonomous driving,...
research
05/12/2022

On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data

Borrowing ideas from Production functions in micro-economics, in this pa...
research
05/18/2021

Learning to Act Safely with Limited Exposure and Almost Sure Certainty

This paper aims to put forward the concept that learning to take safe ac...

Please sign up or login with your details

Forgot password? Click here to reset