Finetuning large pre-trained language models with a task-specific head h...
We present Semi-Structured Explanations for COPA (COPA-SSE), a new
crowd...
Improving model generalization on held-out data is one of the core objec...
Pretrained language models, such as BERT and RoBERTa, have shown large
i...