MK-SQuIT: Synthesizing Questions using Iterative Template-filling

11/04/2020
by   Benjamin A. Spiegel, et al.
0

The aim of this work is to create a framework for synthetically generating question/query pairs with as little human input as possible. These datasets can be used to train machine translation systems to convert natural language questions into queries, a useful tool that could allow for more natural access to database information. Existing methods of dataset generation require human input that scales linearly with the size of the dataset, resulting in small datasets. Aside from a short initial configuration task, no human input is required during the query generation process of our system. We leverage WikiData, a knowledge base of RDF triples, as a source for generating the main content of questions and queries. Using multiple layers of question templating we are able to sidestep some of the most challenging parts of query generation that have been handled by humans in previous methods; humans never have to modify, aggregate, inspect, annotate, or generate any questions or queries at any step in the process. Our system is easily configurable to multiple domains and can be modified to generate queries in natural languages other than English. We also present an example dataset of 110,000 question/query pairs across four WikiData domains. We then present a baseline model that we train using the dataset which shows promise in a commercial QA setting.

READ FULL TEXT
research
10/12/2016

Question Generation from a Knowledge Base with Web Exploration

Question generation from a knowledge base (KB) is the task of generating...
research
09/08/2021

Formal Query Building with Query Structure Prediction for Complex Question Answering over Knowledge Base

Formal query building is an important part of complex question answering...
research
04/21/2020

Knowledge-Driven Distractor Generation for Cloze-style Multiple Choice Questions

In this paper, we propose a novel configurable framework to automaticall...
research
07/25/2016

An Evolutionary Algorithm to Learn SPARQL Queries for Source-Target-Pairs: Finding Patterns for Human Associations in DBpedia

Efficient usage of the knowledge provided by the Linked Data community i...
research
12/15/2020

Generation of complex database queries and API calls from natural language utterances

Generating queries corresponding to natural language questions is a long...
research
10/25/2019

Generating a Common Question from Multiple Documents using Multi-source Encoder-Decoder Models

Ambiguous user queries in search engines result in the retrieval of docu...
research
01/07/2021

Applying Transfer Learning for Improving Domain-Specific Search Experience Using Query to Question Similarity

Search is one of the most common platforms used to seek information. How...

Please sign up or login with your details

Forgot password? Click here to reset