Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering

05/24/2023
by   Wang Zhu, et al.
0

We train a language model (LM) to robustly answer multistep questions by generating and answering sub-questions. We propose Chain-of-Questions, a framework that trains a model to generate sub-questions and sub-answers one at a time by leveraging human annotated question decomposition meaning representation (QDMR). The key technical challenge is that QDMR only contains sub-questions but not answers to those sub-questions, so we treat sub-answers as latent variables and optimize them using a novel dynamic mixture of Hard-EM and MAPO. Chain-of-Questions greatly outperforms strong neuro-symbolic methods by 9.0 F1 on DROP contrast set, and outperforms GPT-3.5 by 24.3 F1 on HOTPOTQA adversarial set, thus demonstrating the effectiveness and robustness of our framework.

READ FULL TEXT
research
10/30/2019

Ensembling Strategies for Answering Natural Questions

Many of the top question answering systems today utilize ensembling to i...
research
04/04/2019

Answer-based Adversarial Training for Generating Clarification Questions

We present an approach for generating clarification questions with the g...
research
02/22/2020

Training Question Answering Models From Synthetic Data

Question and answer generation is a data augmentation method that aims t...
research
09/01/2020

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models

A common approach to solve complex tasks is by breaking them down into s...
research
10/07/2022

Generating Quizzes to Support Training on Quality Management and Assurance in Space Science and Engineering

Quality management and assurance is key for space agencies to guarantee ...
research
10/04/2020

When in Doubt, Ask: Generating Answerable and Unanswerable Questions, Unsupervised

Question Answering (QA) is key for making possible a robust communicatio...
research
09/08/2021

TruthfulQA: Measuring How Models Mimic Human Falsehoods

We propose a benchmark to measure whether a language model is truthful i...

Please sign up or login with your details

Forgot password? Click here to reset