DeepAI AI Chat
Log In Sign Up

Marrying Universal Dependencies and Universal Morphology

by   Arya D. McCarthy, et al.

The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the token level and UniMorph at the type level. As each corpus is built by different annotators, language-specific decisions hinder the goal of universal schemata. With compatibility of tags, each project's annotations could be used to validate the other's. Additionally, the availability of both type- and token-level resources would be a boon to tasks such as parsing and homograph disambiguation. To ease this interoperability, we present a deterministic mapping from Universal Dependencies v2 features into the UniMorph schema. We validate our approach by lookup in the UniMorph corpora and find a macro-average of 64.13 of data on either side. Finally, we present a critical evaluation of the foundations, strengths, and weaknesses of the two annotation projects.


page 1

page 2

page 3

page 4


UniMorph 2.0: Universal Morphology

The Universal Morphology UniMorph project is a collaborative effort to i...

UniMorph 4.0: Universal Morphology

The Universal Morphology (UniMorph) project is a collaborative effort pr...

Mischievous Nominal Constructions in Universal Dependencies

While the highly multilingual Universal Dependencies (UD) project provid...

K-UniMorph: Korean Universal Morphology and its Feature Schema

We present in this work a new Universal Morphology dataset for Korean. P...

Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

In this paper, we launch a new Universal Dependencies treebank for an en...

Universal Dependencies to Logical Forms with Negation Scope

Many language technology applications would benefit from the ability to ...