Overview of Annotation Creation: Processes & Tools

by   Mark A. Finlayson, et al.

Creating linguistic annotations requires more than just a reliable annotation scheme. Annotation can be a complex endeavour potentially involving many people, stages, and tools. This chapter outlines the process of creating end-to-end linguistic annotations, identifying specific tasks that researchers often perform. Because tool support is so central to achieving high quality, reusable annotations with low cost, the focus is on identifying capabilities that are necessary or useful for annotation tools, as well as common problems these tools present that reduce their utility. Although examples of specific tools are provided in many cases, this chapter concentrates more on abstract capabilities and problems because new tools appear continuously, while old tools disappear into disuse or disrepair. The two core capabilities tools must have are support for the chosen annotation scheme and the ability to work on the language under study. Additional capabilities are organized into three categories: those that are widely provided; those that often useful but found in only a few tools; and those that have as yet little or no available tool support.


page 1

page 2

page 3

page 4


BoAT v2 – A Web-Based Dependency Annotation Tool with Focus on Agglutinative Languages

The value of quality treebanks is steadily increasing due to the crucial...

Annotationsaurus: A Searchable Directory of Annotation Tools

Manual annotation of textual documents is a necessary task when construc...

Standardizing linguistic data: method and tools for annotating (pre-orthographic) French

With the development of big corpora of various periods, it becomes cruci...

Text Annotation Graphs: Annotating Complex Natural Language Phenomena

This paper introduces a new web-based software tool for annotating text,...

Creating and Managing a large annotated parallel corpora of Indian languages

This paper presents the challenges in creating and managing large parall...

Understanding How Programmers Can Use Annotations on Documentation

Modern software development requires developers to find and effectively ...

HUMAN: Hierarchical Universal Modular ANnotator

A lot of real-world phenomena are complex and cannot be captured by sing...