Which Spurious Correlations Impact Reasoning in NLI Models? A Visual Interactive Diagnosis through Data-Constrained Counterfactuals

06/21/2023
by   Robin Chan, et al.
0

We present a human-in-the-loop dashboard tailored to diagnosing potential spurious features that NLI models rely on for predictions. The dashboard enables users to generate diverse and challenging examples by drawing inspiration from GPT-3 suggestions. Additionally, users can receive feedback from a trained NLI model on how challenging the newly created example is and make refinements based on the feedback. Through our investigation, we discover several categories of spurious correlations that impact the reasoning of NLI models, which we group into three categories: Semantic Relevance, Logical Fallacies, and Bias. Based on our findings, we identify and describe various research opportunities, including diversifying training data and assessing NLI models' robustness by creating adversarial test suites.

READ FULL TEXT
research
07/25/2020

Feedback Loop and Bias Amplification in Recommender Systems

Recommendation algorithms are known to suffer from popularity bias; a fe...
research
08/08/2023

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

To use reinforcement learning from human feedback (RLHF) in practical ap...
research
05/09/2020

An Investigation of Why Overparameterization Exacerbates Spurious Correlations

We study why overparameterization – increasing model size well beyond th...
research
12/16/2019

Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing

Despite significant success in Visual Question Answering (VQA), VQA mode...
research
01/02/2018

Deep Learning for Identifying Potential Conceptual Shifts for Co-creative Drawing

We present a system for identifying conceptual shifts between visual cat...
research
10/24/2022

Does Self-Rationalization Improve Robustness to Spurious Correlations?

Rationalization is fundamental to human reasoning and learning. NLP mode...
research
05/31/2023

Let's Verify Step by Step

In recent years, large language models have greatly improved in their ab...

Please sign up or login with your details

Forgot password? Click here to reset