Overview of Annotation Creation: Processes & Tools

02/18/2016
by   Mark A. Finlayson, et al.
0

Creating linguistic annotations requires more than just a reliable annotation scheme. Annotation can be a complex endeavour potentially involving many people, stages, and tools. This chapter outlines the process of creating end-to-end linguistic annotations, identifying specific tasks that researchers often perform. Because tool support is so central to achieving high quality, reusable annotations with low cost, the focus is on identifying capabilities that are necessary or useful for annotation tools, as well as common problems these tools present that reduce their utility. Although examples of specific tools are provided in many cases, this chapter concentrates more on abstract capabilities and problems because new tools appear continuously, while old tools disappear into disuse or disrepair. The two core capabilities tools must have are support for the chosen annotation scheme and the ability to work on the language under study. Additional capabilities are organized into three categories: those that are widely provided; those that often useful but found in only a few tools; and those that have as yet little or no available tool support.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2022

BoAT v2 – A Web-Based Dependency Annotation Tool with Focus on Agglutinative Languages

The value of quality treebanks is steadily increasing due to the crucial...
research
10/13/2020

Annotationsaurus: A Searchable Directory of Annotation Tools

Manual annotation of textual documents is a necessary task when construc...
research
11/22/2020

Standardizing linguistic data: method and tools for annotating (pre-orthographic) French

With the development of big corpora of various periods, it becomes cruci...
research
11/01/2017

Text Annotation Graphs: Annotating Complex Natural Language Phenomena

This paper introduces a new web-based software tool for annotating text,...
research
05/23/2023

EASE: An Easily-Customized Annotation System Powered by Efficiency Enhancement Mechanisms

The performance of current supervised AI systems is tightly connected to...
research
12/03/2021

Creating and Managing a large annotated parallel corpora of Indian languages

This paper presents the challenges in creating and managing large parall...
research
12/17/2021

Can we Fix the Scope for Coreference? Problems and Solutions for Benchmarks beyond OntoNotes

Current work on automatic coreference resolution has focused on the Onto...

Please sign up or login with your details

Forgot password? Click here to reset