Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks

12/15/2022
by   Alexander Braylan, et al.
0

When annotators label data, a key metric for quality assurance is inter-annotator agreement (IAA): the extent to which annotators agree on their labels. Though many IAA measures exist for simple categorical and ordinal labeling tasks, relatively little work has considered more complex labeling tasks, such as structured, multi-object, and free-text annotations. Krippendorff's alpha, best known for use with simpler labeling tasks, does have a distance-based formulation with broader applicability, but little work has studied its efficacy and consistency across complex annotation tasks. We investigate the design and evaluation of IAA measures for complex annotation tasks, with evaluation spanning seven diverse tasks: image bounding boxes, image keypoints, text sequence tagging, ranked lists, free text translations, numeric vectors, and syntax trees. We identify the difficulty of interpretability and the complexity of choosing a distance function as key obstacles in applying Krippendorff's alpha generally across these tasks. We propose two novel, more interpretable measures, showing they yield more consistent IAA measures across tasks and annotation distance functions.

READ FULL TEXT

page 3

page 4

page 11

research
01/25/2023

Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement

We commonly use agreement measures to assess the utility of judgements m...
research
09/17/2022

DiPietro-Hazari Kappa: A Novel Metric for Assessing Labeling Quality via Annotation

Data is a key component of modern machine learning, but statistics for a...
research
06/23/2014

Multi-utility Learning: Structured-output Learning with Multiple Annotation-specific Loss Functions

Structured-output learning is a challenging problem; particularly so bec...
research
12/16/2022

POTATO: The Portable Text Annotation Tool

We present POTATO, the Portable text annotation tool, a free, fully open...
research
08/12/2022

Sparse Probability of Agreement

Measuring inter-annotator agreement is important for annotation tasks, b...
research
06/26/2023

Transcending Traditional Boundaries: Leveraging Inter-Annotator Agreement (IAA) for Enhancing Data Management Operations (DMOps)

This paper presents a novel approach of leveraging Inter-Annotator Agree...
research
05/25/2019

Efficient Object Annotation via Speaking and Pointing

Deep neural networks deliver state-of-the-art visual recognition, but th...

Please sign up or login with your details

Forgot password? Click here to reset