Shortcut Learning of Large Language Models in Natural Language Understanding: A Survey

08/25/2022
by   Mengnan Du, et al.
0

Large language models (LLMs) have achieved state-of-the-art performance on a series of natural language understanding tasks. However, these LLMs might rely on dataset bias and artifacts as shortcuts for prediction. This has significantly hurt their Out-of-Distribution (OOD) generalization and adversarial robustness. In this paper, we provide a review of recent developments that address the robustness challenge of LLMs. We first introduce the concepts and robustness challenge of LLMs. We then introduce methods to identify shortcut learning behavior in LLMs, characterize the reasons for shortcut learning, as well as introduce mitigation solutions. Finally, we identify key challenges and introduce the connections of this line of research to other directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2020

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

When scaled to hundreds of billions of parameters, pretrained language m...
research
09/09/2021

Debiasing Methods in Natural Language Understanding Make Bias More Accessible

Model robustness to bias is often determined by the generalization on ca...
research
06/16/2022

Methods for Estimating and Improving Robustness of Language Models

Despite their outstanding performance, large language models (LLMs) suff...
research
05/24/2022

FLUTE: Figurative Language Understanding and Textual Explanations

In spite of the prevalence of figurative language, transformer-based mod...
research
07/19/2022

Analyzing Bagging Methods for Language Models

Modern language models leverage increasingly large numbers of parameters...
research
05/30/2022

Parameter Efficient Diff Pruning for Bias Mitigation

In recent years language models have achieved state of the art performan...
research
08/23/2023

Diagnosing Infeasible Optimization Problems Using Large Language Models

Decision-making problems can be represented as mathematical optimization...

Please sign up or login with your details

Forgot password? Click here to reset