DIFFQG: Generating Questions to Summarize Factual Changes

03/01/2023
by   Jeremy R. Cole, et al.
0

Identifying the difference between two versions of the same article is useful to update knowledge bases and to understand how articles evolve. Paired texts occur naturally in diverse situations: reporters write similar news stories and maintainers of authoritative websites must keep their information up to date. We propose representing factual changes between paired documents as question-answer pairs, where the answer to the same question differs between two versions. We find that question-answer pairs can flexibly and concisely capture the updated contents. Provided with paired documents, annotators identify questions that are answered by one passage but answered differently or cannot be answered by the other. We release DIFFQG which consists of 759 QA pairs and 1153 examples of paired passages with no factual change. These questions are intended to be both unambiguous and information-seeking and involve complex edits, pushing beyond the capabilities of current question generation and factual change detection systems. Our dataset summarizes the changes between two versions of the document as questions and answers, studying automatic update summarization in a novel way.

READ FULL TEXT
research
09/10/2021

Generating Self-Contained and Summary-Centric Question Answer Pairs via Differentiable Reward Imitation Learning

Motivated by suggested question generation in conversational news recomm...
research
05/27/2022

V-Doc : Visual questions answers with Documents

We propose V-Doc, a question-answering tool using document images and PD...
research
10/22/2020

Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval

Recent progress in pretrained language model "solved" many reading compr...
research
06/06/2022

Learning to Ask Like a Physician

Existing question answering (QA) datasets derived from electronic health...
research
07/18/2023

How is ChatGPT's behavior changing over time?

GPT-3.5 and GPT-4 are the two most widely used large language model (LLM...
research
04/27/2023

ChatLog: Recording and Analyzing ChatGPT Across Time

While there are abundant researches about evaluating ChatGPT on natural ...
research
03/08/2023

Class Cardinality Comparison as a Fermi Problem

Questions on class cardinality comparisons are quite tricky to answer an...

Please sign up or login with your details

Forgot password? Click here to reset