Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

06/21/2022
by   Roberto Zariquiey, et al.
0

In this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2017

BKTreebank: Building a Vietnamese Dependency Treebank

Dependency treebank is an important resource in any language. In this pa...
research
05/13/2016

Universal Dependencies for Learner English

We introduce the Treebank of Learner English (TLE), the first publicly a...
research
04/29/2020

UDapter: Language Adaptation for Truly Universal Dependency Parsing

Recent advances in the field of multilingual dependency parsing have bro...
research
09/07/2019

Dependency Parsing for Spoken Dialog Systems

Dependency parsing of conversational input can play an important role in...
research
09/20/2022

Yet Another Format of Universal Dependencies for Korean

In this study, we propose a morpheme-based scheme for Korean dependency ...
research
10/15/2018

Marrying Universal Dependencies and Universal Morphology

The Universal Dependencies (UD) and Universal Morphology (UniMorph) proj...
research
04/23/2018

Parsing Tweets into Universal Dependencies

We study the problem of analyzing tweets with Universal Dependencies. We...

Please sign up or login with your details

Forgot password? Click here to reset