On Generalization in Coreference Resolution

09/20/2021
by   Shubham Toshniwal, et al.
5

While coreference resolution is defined independently of dataset domain, most models for performing coreference resolution do not transfer well to unseen domains. We consolidate a set of 8 coreference resolution datasets targeting different domains to evaluate the off-the-shelf performance of models. We then mix three datasets for training; even though their domain, annotation guidelines, and metadata differ, we propose a method for jointly training a single model on this heterogeneous data mixture by using data augmentation to account for annotation differences and sampling to balance the data quantities. We find that in a zero-shot setting, models trained on a single dataset transfer poorly while joint training yields improved overall performance, leading to better generalization in coreference resolution models. This work contributes a new benchmark for robust coreference resolution and multiple new state-of-the-art results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2021

Moving on from OntoNotes: Coreference Resolution Model Transfer

Academic neural models for coreference resolution are typically trained ...
research
03/16/2023

Investigating Failures to Generalize for Coreference Resolution Models

Coreference resolution models are often evaluated on multiple datasets. ...
research
08/02/2022

Multilingual Coreference Resolution in Multiparty Dialogue

Existing multiparty dialogue datasets for coreference resolution are nas...
research
08/17/2020

Zero Shot Domain Generalization

Standard supervised learning setting assumes that training data and test...
research
10/15/2021

Reappraising Domain Generalization in Neural Networks

Domain generalization (DG) of machine learning algorithms is defined as ...
research
12/27/2021

MSeg: A Composite Dataset for Multi-domain Semantic Segmentation

We present MSeg, a composite dataset that unifies semantic segmentation ...
research
02/23/2023

Out-of-Domain Robustness via Targeted Augmentations

Models trained on one set of domains often suffer performance drops on u...

Please sign up or login with your details

Forgot password? Click here to reset