Multi-Type Conversational Question-Answer Generation with Closed-ended and Unanswerable Questions

10/24/2022
by   Seonjeong Hwang, et al.
0

Conversational question answering (CQA) facilitates an incremental and interactive understanding of a given context, but building a CQA system is difficult for many domains due to the problem of data scarcity. In this paper, we introduce a novel method to synthesize data for CQA with various question types, including open-ended, closed-ended, and unanswerable questions. We design a different generation flow for each question type and effectively combine them in a single, shared framework. Moreover, we devise a hierarchical answerability classification (hierarchical AC) module that improves quality of the synthetic data while acquiring unanswerable questions. Manual inspections show that synthetic data generated with our framework have characteristics very similar to those of human-generated conversations. Across four domains, CQA systems trained on our synthetic data indeed show good performance close to the systems trained on human-annotated data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2022

Conversational QA Dataset Generation with Answer Revision

Conversational question–answer generation is a task that automatically g...
research
02/04/2021

ChainCQG: Flow-Aware Conversational Question Generation

Conversational systems enable numerous valuable applications, and questi...
research
09/10/2020

Sanitizing Synthetic Training Data Generation for Question Answering over Knowledge Graphs

Synthetic data generation is important to training and evaluating neural...
research
10/19/2020

Understanding Unnatural Questions Improves Reasoning over Text

Complex question answering (CQA) over raw text is a challenging task. A ...
research
05/17/2022

"What makes a question inquisitive?" A Study on Type-Controlled Inquisitive Question Generation

We propose a type-controlled framework for inquisitive question generati...
research
05/13/2018

Learning to Ask Questions in Open-domain Conversational Systems with Typed Decoders

Asking good questions in large-scale, open-domain conversational systems...
research
06/15/2019

Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation

An end-to-end data integration system requires human feedback in several...

Please sign up or login with your details

Forgot password? Click here to reset