Marrying Universal Dependencies and Universal Morphology

10/15/2018
by   Arya D. McCarthy, et al.
0

The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the token level and UniMorph at the type level. As each corpus is built by different annotators, language-specific decisions hinder the goal of universal schemata. With compatibility of tags, each project's annotations could be used to validate the other's. Additionally, the availability of both type- and token-level resources would be a boon to tasks such as parsing and homograph disambiguation. To ease this interoperability, we present a deterministic mapping from Universal Dependencies v2 features into the UniMorph schema. We validate our approach by lookup in the UniMorph corpora and find a macro-average of 64.13 of data on either side. Finally, we present a critical evaluation of the foundations, strengths, and weaknesses of the two annotation projects.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2018

UniMorph 2.0: Universal Morphology

The Universal Morphology UniMorph project is a collaborative effort to i...
research
05/07/2022

UniMorph 4.0: Universal Morphology

The Universal Morphology (UniMorph) project is a collaborative effort pr...
research
08/29/2021

Mischievous Nominal Constructions in Universal Dependencies

While the highly multilingual Universal Dependencies (UD) project provid...
research
05/10/2023

K-UniMorph: Korean Universal Morphology and its Feature Schema

We present in this work a new Universal Morphology dataset for Korean. P...
research
06/21/2022

Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

In this paper, we launch a new Universal Dependencies treebank for an en...
research
12/09/2021

How Universal is Genre in Universal Dependencies?

This work provides the first in-depth analysis of genre in Universal Dep...
research
05/26/2020

Analysis of the Penn Korean Universal Dependency Treebank (PKT-UD): Manual Revision to Build Robust Parsing Model in Korean

In this paper, we first open on important issues regarding the Penn Kore...

Please sign up or login with your details

Forgot password? Click here to reset