Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models

05/29/2020
by   Viktor Schlegel, et al.
0

Recent years have seen a growing number of publications that analyse Natural Language Inference (NLI) datasets for superficial cues, whether they undermine the complexity of the tasks underlying those datasets and how they impact those models that are optimised and evaluated on this data. This structured survey provides an overview of the evolving research area by categorising reported weaknesses in models and datasets and the methods proposed to reveal and alleviate those weaknesses for the English language. We summarise and discuss the findings and conclude with a set of recommendations for possible future research directions. We hope it will be a useful resource for researchers who propose new datasets, to have a set of tools to assess the suitability and quality of their data to evaluate various phenomena of interest, as well as those who develop novel architectures, to further understand the implications of their improvements with respect to their model's acquired capabilities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2018

Stress Test Evaluation for Natural Language Inference

Natural language inference (NLI) is the task of determining if a natural...
research
12/17/2020

Continual Lifelong Learning in Natural Language Processing: A Survey

Continual learning (CL) aims to enable information systems to learn from...
research
04/19/2020

Evolution of Semantic Similarity – A Survey

Estimating the semantic similarity between text data is one of the chall...
research
06/07/2023

A Survey on Generative Diffusion Models for Structured Data

In recent years, generative diffusion models have achieved a rapid parad...
research
07/31/2020

Neural Language Generation: Formulation, Methods, and Evaluation

Recent advances in neural network-based generative modeling have reignit...
research
06/23/2015

A Survey of Current Datasets for Vision and Language Research

Integrating vision and language has long been a dream in work on artific...
research
07/12/2023

A Comprehensive Overview of Large Language Models

Large Language Models (LLMs) have shown excellent generalization capabil...

Please sign up or login with your details

Forgot password? Click here to reset