Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions

07/28/2022
by   Yanai Elazar, et al.
0

Large amounts of training data are one of the major reasons for the high performance of state-of-the-art NLP models. But what exactly in the training data causes a model to make a certain prediction? We seek to answer this question by providing a language for describing how training data influences predictions, through a causal framework. Importantly, our framework bypasses the need to retrain expensive models and allows us to estimate causal effects based on observational data alone. Addressing the problem of extracting factual knowledge from pretrained language models (PLMs), we focus on simple data statistics such as co-occurrence counts and show that these statistics do influence the predictions of PLMs, suggesting that such models rely on shallow heuristics. Our causal framework and our results demonstrate the importance of studying datasets and the benefits of causality for understanding NLP models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2014

Estimating complex causal effects from incomplete observational data

Despite the major advances taken in causal modeling, causality is still ...
research
06/14/2019

Scalable Syntax-Aware Language Models Using Knowledge Distillation

Prior work has shown that, on small amounts of training data, syntactic ...
research
08/24/2023

Causal Parrots: Large Language Models May Talk Causality But Are Not Causal

Some argue scale is all what is needed to achieve AI, covering even caus...
research
11/08/2022

SocioProbe: What, When, and Where Language Models Learn about Sociodemographics

Pre-trained language models (PLMs) have outperformed other NLP models on...
research
08/07/2023

A Cross-Domain Evaluation of Approaches for Causal Knowledge Extraction

Causal knowledge extraction is the task of extracting relevant causes an...
research
01/12/2020

Towards causality-aware predictions in static machine learning tasks: the linear structural causal model case

While counterfactual thinking has been used in ML tasks that aim to pred...
research
03/28/2018

Supervising Feature Influence

Causal influence measures for machine learnt classifiers shed light on t...

Please sign up or login with your details

Forgot password? Click here to reset