Constructing a Natural Language Inference Dataset using Generative Neural Networks

07/20/2016
by   Janez Starc, et al.
0

Natural Language Inference is an important task for Natural Language Understanding. It is concerned with classifying the logical relation between two sentences. In this paper, we propose several text generative neural networks for generating text hypothesis, which allows construction of new Natural Language Inference datasets. To evaluate the models, we propose a new metric -- the accuracy of the classifier trained on the generated dataset. The accuracy obtained by our best generative model is only 2.7 accuracy of the classifier trained on the original, human crafted dataset. Furthermore, the best generated dataset combined with the original dataset achieves the highest accuracy. The best model learns a mapping embedding for each training example. By comparing various metrics we show that datasets that obtain higher ROUGE or METEOR scores do not necessarily yield higher classification accuracies. We also provide analysis of what are the characteristics of a good dataset including the distinguishability of the generated datasets from the original one.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2021

A Puzzle-Based Dataset for Natural Language Inference

We provide here a dataset for tasks related to natural language understa...
research
03/13/2022

SciNLI: A Corpus for Natural Language Inference on Scientific Text

Existing Natural Language Inference (NLI) datasets, while being instrume...
research
03/17/2022

RoMe: A Robust Metric for Evaluating Natural Language Generation

Evaluating Natural Language Generation (NLG) systems is a challenging ta...
research
07/02/2022

FRAME: Evaluating Simulatability Metrics for Free-Text Rationales

Free-text rationales aim to explain neural language model (LM) behavior ...
research
10/30/2022

Validity Assessment of Legal Will Statements as Natural Language Inference

This work introduces a natural language inference (NLI) dataset that foc...
research
05/26/2019

TIGS: An Inference Algorithm for Text Infilling with Gradient Search

Text infilling is defined as a task for filling in the missing part of a...
research
04/09/2021

Text2Chart: A Multi-Staged Chart Generator from Natural Language Text

Generation of scientific visualization from analytical natural language ...

Please sign up or login with your details

Forgot password? Click here to reset