Open Information Extraction from Question-Answer Pairs

by   Nikita Bhutani, et al.

Open Information Extraction (OpenIE) extracts meaningful structured tuples from free-form text. Most previous work on OpenIE considers extracting data from one sentence at a time. We describe NeurON, a system for extracting tuples from question-answer pairs. Since real questions and answers often contain precisely the information that users care about, such information is particularly desirable to extend a knowledge base with. NeurON addresses several challenges. First, an answer text is often hard to understand without knowing the question, and second, relevant information can span multiple sentences. To address these, NeurON formulates extraction as a multi-source sequence-to-sequence learning task, wherein it combines distributed representations of a question and an answer to generate knowledge facts. We describe experiments on two real-world datasets that demonstrate that NeurON can find a significant number of new and interesting facts to extend a knowledge base compared to state-of-the-art OpenIE methods.


page 1

page 2

page 3

page 4


Neural Generative Question Answering

This paper presents an end-to-end neural network model, named Neural Gen...

Tag and Correct: Question aware Open Information Extraction with Two-stage Decoding

Question Aware Open Information Extraction (Question aware Open IE) take...

Multi-Task Learning with Multi-View Attention for Answer Selection and Knowledge Base Question Answering

Answer selection and knowledge base question answering (KBQA) are two im...

Asking Questions Like Educational Experts: Automatically Generating Question-Answer Pairs on Real-World Examination Data

Generating high quality question-answer pairs is a hard but meaningful t...

A Production Oriented Approach for Vandalism Detection in Wikidata - The Buffaloberry Vandalism Detector at WSDM Cup 2017

Wikidata is a free and open knowledge base from the Wikimedia Foundation...

KGCleaner : Identifying and Correcting Errors Produced by Information Extraction Systems

KGCleaner is a framework to identify and correct errors in data produced...

Part Whole Extraction: Towards A Deep Understanding of Quantitative Facts for Percentages in Text

We study the problem of quantitative facts extraction for text with perc...