Negation-Instance Based Evaluation of End-to-End Negation Resolution

09/21/2021
by   Elizaveta Sineva, et al.
0

In this paper, we revisit the task of negation resolution, which includes the subtasks of cue detection (e.g. "not", "never") and scope resolution. In the context of previous shared tasks, a variety of evaluation metrics have been proposed. Subsequent works usually use different subsets of these, including variations and custom implementations, rendering meaningful comparisons between systems difficult. Examining the problem both from a linguistic perspective and from a downstream viewpoint, we here argue for a negation-instance based approach to evaluating negation resolution. Our proposed metrics correspond to expectations over per-instance scores and hence are intuitively interpretable. To render research comparable and to foster future work, we provide results for a set of current state-of-the-art systems for negation resolution on three English corpora, and make our implementation of the evaluation scripts publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2021

An end-to-end Optical Character Recognition approach for ultra-low-resolution printed text images

Some historical and more recent printed documents have been scanned or s...
research
06/02/2021

OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More Genres

SOTA coreference resolution produces increasingly impressive scores on t...
research
06/11/2020

CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

Despite the recent success of text detection and recognition methods, ex...
research
04/14/2020

A Human Evaluation of AMR-to-English Generation Systems

Most current state-of-the art systems for generating English text from A...
research
10/20/2021

Better than Average: Paired Evaluation of NLP Systems

Evaluation in NLP is usually done by comparing the scores of competing s...
research
11/08/2022

Review of coreference resolution in English and Persian

Coreference resolution (CR) is one of the most challenging areas of natu...
research
03/16/2023

Investigating Failures to Generalize for Coreference Resolution Models

Coreference resolution models are often evaluated on multiple datasets. ...

Please sign up or login with your details

Forgot password? Click here to reset