Annotation Uncertainty in the Context of Grammatical Change

05/15/2021
by   Marie-Luis Merten, et al.
0

This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by lacking annotation expertise. By examining annotation uncertainty in more detail, we identify the sources and deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice. Moreover, some practical implications of our theoretical findings are also discussed. Last but not least, this article can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2023

UnScientify: Detecting Scientific Uncertainty in Scholarly Full Text

This demo paper presents UnScientify, an interactive system designed to ...
research
05/15/2023

Using LLM-assisted Annotation for Corpus Linguistics: A Case Study of Local Grammar Analysis

Chatbots based on Large Language Models (LLMs) have shown strong capabil...
research
06/27/2017

Using text analysis to quantify the similarity and evolution of scientific disciplines

We use an information-theoretic measure of linguistic similarity to inve...
research
04/02/2020

NUBES: A Corpus of Negation and Uncertainty in Spanish Clinical Texts

This paper introduces the first version of the NUBes corpus (Negation an...
research
11/22/2020

Standardizing linguistic data: method and tools for annotating (pre-orthographic) French

With the development of big corpora of various periods, it becomes cruci...
research
05/15/2023

Skin Deep: Investigating Subjectivity in Skin Tone Annotations for Computer Vision Benchmark Datasets

To investigate the well-observed racial disparities in computer vision s...
research
04/19/2020

The Morality and Rationality of Ambiguity Aversion

In their article, "Egalitarianism under Severe Uncertainty", (Philosophy...

Please sign up or login with your details

Forgot password? Click here to reset