AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning

11/29/2022
by   Jiaxin Wen, et al.
0

Recent studies have shown the impressive efficacy of counterfactually augmented data (CAD) for reducing NLU models' reliance on spurious features and improving their generalizability. However, current methods still heavily rely on human efforts or task-specific designs to generate counterfactuals, thereby impeding CAD's applicability to a broad range of NLU tasks. In this paper, we present AutoCAD, a fully automatic and task-agnostic CAD generation framework. AutoCAD first leverages a classifier to unsupervisedly identify rationales as spans to be intervened, which disentangles spurious and causal features. Then, AutoCAD performs controllable generation enhanced by unlikelihood training to produce diverse counterfactuals. Extensive evaluations on multiple out-of-domain and challenge benchmarks demonstrate that AutoCAD consistently and significantly boosts the out-of-distribution performance of powerful pre-trained models across different NLU tasks, which is comparable or even better than previous state-of-the-art human-in-the-loop or task-specific CAD methods. The code is publicly available at https://github.com/thu-coai/AutoCAD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2023

Hierarchical Neural Coding for Controllable CAD Model Generation

This paper presents a novel generative model for Computer Aided Design (...
research
12/15/2020

A Closer Look at the Robustness of Vision-and-Language Pre-trained Models

Large-scale pre-trained multimodal transformers, such as ViLBERT and UNI...
research
02/18/2023

Improving the Out-Of-Distribution Generalization Capability of Language Models: Counterfactually-Augmented Data is not Enough

Counterfactually-Augmented Data (CAD) has the potential to improve langu...
research
09/14/2021

How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

As NLP models are increasingly deployed in socially situated settings su...
research
03/29/2022

Task-specific Inconsistency Alignment for Domain Adaptive Object Detection

Detectors trained with massive labeled data often exhibit dramatic perfo...
research
07/01/2021

An Investigation of the (In)effectiveness of Counterfactually Augmented Data

While pretrained language models achieve excellent performance on natura...
research
05/09/2022

Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection

Counterfactually Augmented Data (CAD) aims to improve out-of-domain gene...

Please sign up or login with your details

Forgot password? Click here to reset