Realistic Conversational Question Answering with Answer Selection based on Calibrated Confidence and Uncertainty Measurement

02/10/2023
by   Soyeong Jeong, et al.
0

Conversational Question Answering (ConvQA) models aim at answering a question with its relevant paragraph and previous question-answer pairs that occurred during conversation multiple times. To apply such models to a real-world scenario, some existing work uses predicted answers, instead of unavailable ground-truth answers, as the conversation history for inference. However, since these models usually predict wrong answers, using all the predictions without filtering significantly hampers the model performance. To address this problem, we propose to filter out inaccurate answers in the conversation history based on their estimated confidences and uncertainties from the ConvQA model, without making any architectural changes. Moreover, to make the confidence and uncertainty values more reliable, we propose to further calibrate them, thereby smoothing the model predictions. We validate our models, Answer Selection-based realistic Conversation Question Answering, on two standard ConvQA datasets, and the results show that our models significantly outperform relevant baselines. Code is available at: https://github.com/starsuzi/AS-ConvQA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2020

Do not let the history haunt you – Mitigating Compounding Errors in Conversational Question Answering

The Conversational Question Answering (CoQA) task involves answering a s...
research
06/07/2023

Phrase Retrieval for Open-Domain Conversational Question Answering with Conversational Dependency Modeling via Contrastive Learning

Open-Domain Conversational Question Answering (ODConvQA) aims at answeri...
research
10/02/2021

TopiOCQA: Open-domain Conversational Question Answeringwith Topic Switching

In a conversational question answering scenario, a questioner seeks to e...
research
03/13/2021

ParaQA: A Question Answering Dataset with Paraphrase Responses for Single-Turn Conversation

This paper presents ParaQA, a question answering (QA) dataset with multi...
research
08/18/2023

Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models

Video Question Answering (VideoQA) is a challenging task that entails co...
research
06/29/2022

What Can Secondary Predictions Tell Us? An Exploration on Question-Answering with SQuAD-v2.0

Performance in natural language processing, and specifically for the que...

Please sign up or login with your details

Forgot password? Click here to reset