Conformal Prediction with Large Language Models for Multi-Choice Question Answering

05/28/2023
by   Bhawesh Kumar, et al.
0

As large language models continue to be widely developed, robust uncertainty quantification techniques will become crucial for their safe deployment in high-stakes scenarios. In this work, we explore how conformal prediction can be used to provide uncertainty quantification in language models for the specific task of multiple-choice question-answering. We find that the uncertainty estimates from conformal prediction are tightly correlated with prediction accuracy. This observation can be useful for downstream applications such as selective classification and filtering out low-quality predictions. We also investigate the exchangeability assumption required by conformal prediction to out-of-subject questions, which may be a more realistic scenario for many practical applications. Our work contributes towards more trustworthy and reliable usage of large language models in safety-critical situations, where robust guarantees of error rate are required.

READ FULL TEXT

page 4

page 7

page 10

research
07/18/2023

PAC Neural Prediction Set Learning to Quantify the Uncertainty of Generative Language Models

Uncertainty learning and quantification of models are crucial tasks to e...
research
08/06/2023

Building Safe and Reliable AI systems for Safety Critical Tasks with Vision-Language Processing

Although AI systems have been applied in various fields and achieved imp...
research
12/09/2022

Reliable Multimodal Trajectory Prediction via Error Aligned Uncertainty Optimization

Reliable uncertainty quantification in deep neural networks is very cruc...
research
08/07/2023

Trusting Language Models in Education

Language Models are being widely used in Education. Even though modern d...
research
05/24/2023

Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4

Misinformation poses a critical societal challenge, and current approach...
research
09/21/2023

Knowledge Sanitization of Large Language Models

We explore a knowledge sanitization approach to mitigate the privacy con...
research
07/29/2021

Quantifying Uncertainty for Machine Learning Based Diagnostic

Virtual Diagnostic (VD) is a deep learning tool that can be used to pred...

Please sign up or login with your details

Forgot password? Click here to reset