DISCO: Distilling Phrasal Counterfactuals with Large Language Models

12/20/2022
by   Zeming Chen, et al.
0

Recent methods demonstrate that data augmentation using counterfactual knowledge can teach models the causal structure of a task, leading to robust and generalizable models. However, such counterfactual data often has a limited scale and diversity if crowdsourced and is computationally expensive to extend to new perturbation types if generated using supervised methods. To address this, we introduce a new framework called DISCO for automatically generating high-quality counterfactual data at scale. DISCO engineers prompts to generate phrasal perturbations with a large general language model. Then, a task-specific teacher model filters the generation to distill high-quality counterfactual data. We show that learning with this counterfactual data yields a comparatively small student model that is 6 generalizes 5 challenging evaluations. This model is also 15 differentiating original and counterfactual examples, on three evaluation sets written by human workers and via human-AI collaboration.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2021

Retrieval-guided Counterfactual Generation for QA

Deep NLP models have been shown to learn spurious correlations, leaving ...
research
01/01/2021

Polyjuice: Automated, General-purpose Counterfactual Generation

Counterfactual examples have been shown to be useful for many applicatio...
research
05/26/2023

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

Selective rationales and counterfactual examples have emerged as two eff...
research
10/10/2022

CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation

Counterfactual data augmentation (CDA) – i.e., adding minimally perturbe...
research
08/02/2020

SemEval-2020 Task 5: Counterfactual Recognition

We present a counterfactual recognition (CR) task, the shared Task 5 of ...
research
05/23/2023

Counterfactual Augmentation for Multimodal Learning Under Presentation Bias

In real-world machine learning systems, labels are often derived from us...
research
09/10/2021

HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge

A hyperbole is an intentional and creative exaggeration not to be taken ...

Please sign up or login with your details

Forgot password? Click here to reset