OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More Genres

06/02/2021
by   YIlun Zhu, et al.
0

SOTA coreference resolution produces increasingly impressive scores on the OntoNotes benchmark. However lack of comparable data following the same scheme for more genres makes it difficult to evaluate generalizability to open domain data. This paper provides a dataset and comprehensive evaluation showing that the latest neural LM based end-to-end systems degrade very substantially out of domain. We make an OntoNotes-like coreference dataset called OntoGUM publicly available, converted from GUM, an English corpus covering 12 genres, using deterministic rules, which we evaluate. Thanks to the rich syntactic and discourse annotations in GUM, we are able to create the largest human-annotated coreference corpus following the OntoNotes guidelines, and the first to be evaluated for consistency with the OntoNotes scheme. Out-of-domain evaluation across 12 genres shows nearly 15-20 deep learning systems, indicating a lack of generalizability or covert overfitting in existing coreference resolution models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2021

Anatomy of OntoGUM–Adapting GUM to the OntoNotes Scheme to Evaluate Robustness of SOTA Coreference Algorithms

SOTA coreference resolution produces increasingly impressive scores on t...
research
09/21/2021

Negation-Instance Based Evaluation of End-to-End Negation Resolution

In this paper, we revisit the task of negation resolution, which include...
research
10/29/2017

JESC: Japanese-English Subtitle Corpus

In this paper we describe the Japanese-English Subtitle Corpus (JESC). J...
research
07/26/2021

Multilingual Coreference Resolution with Harmonized Annotations

In this paper, we present coreference resolution experiments with a newl...
research
06/06/2017

Marmara Turkish Coreference Corpus and Coreference Resolution Baseline

We describe the Marmara Turkish Coreference Corpus, which is an annotati...
research
05/21/2018

NeuralREG: An end-to-end approach to referring expression generation

Traditionally, Referring Expression Generation (REG) models first decide...

Please sign up or login with your details

Forgot password? Click here to reset