ANLIzing the Adversarial Natural Language Inference Dataset

10/24/2020
by   Adina Williams, et al.
0

We perform an in-depth error analysis of Adversarial NLI (ANLI), a recently introduced large-scale human-and-model-in-the-loop natural language inference dataset collected over multiple rounds. We propose a fine-grained annotation scheme of the different aspects of inference that are responsible for the gold classification labels, and use it to hand-code all three of the ANLI development sets. We use these annotations to answer a variety of interesting questions: which inference types are most common, which models have the highest performance on each reasoning type, and which types are the most challenging for state of-the-art models? We hope that our annotations will enable more fine-grained evaluation of models trained on ANLI, provide us with a deeper understanding of where models fail and succeed, and help us determine how to train better models in future.

READ FULL TEXT
research
12/07/2019

Adversarial Analysis of Natural Language Inference Systems

The release of large natural language inference (NLI) datasets like SNLI...
research
12/31/2020

Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection

We present a first-of-its-kind large synthetic training dataset for onli...
research
10/20/2020

Natural Language Inference with Mixed Effects

There is growing evidence that the prevalence of disagreement in the raw...
research
06/18/2019

Hyperintensional Reasoning based on Natural Language Knowledge Base

The success of automated reasoning techniques over large natural-languag...
research
06/15/2022

Towards ML Methods for Biodiversity: A Novel Wild Bee Dataset and Evaluations of XAI Methods for ML-Assisted Rare Species Annotations

Insects are a crucial part of our ecosystem. Sadly, in the past few deca...
research
09/16/2021

Does External Knowledge Help Explainable Natural Language Inference? Automatic Evaluation vs. Human Ratings

Natural language inference (NLI) requires models to learn and apply comm...
research
10/31/2019

Adversarial NLI: A New Benchmark for Natural Language Understanding

We introduce a new large-scale NLI benchmark dataset, collected via an i...

Please sign up or login with your details

Forgot password? Click here to reset