K-UniMorph: Korean Universal Morphology and its Feature Schema

05/10/2023
by   Eunkyul Leah Jo, et al.
0

We present in this work a new Universal Morphology dataset for Korean. Previously, the Korean language has been underrepresented in the field of morphological paradigms amongst hundreds of diverse world languages. Hence, we propose this Universal Morphological paradigms for the Korean language that preserve its distinct characteristics. For our K-UniMorph dataset, we outline each grammatical criterion in detail for the verbal endings, clarify how to extract inflected forms, and demonstrate how we generate the morphological schemata. This dataset adopts morphological feature schema from Sylak-Glassman et al. (2015) and Sylak-Glassman (2016) for the Korean language as we extract inflected verb forms from the Sejong morphologically analyzed corpus that is one of the largest annotated corpora for Korean. During the data creation, our methodology also includes investigating the correctness of the conversion from the Sejong corpus. Furthermore, we carry out the inflection task using three different Korean word forms: letters, syllables and morphemes. Finally, we discuss and describe future perspectives on Korean morphological paradigms and the dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2022

UniMorph 4.0: Universal Morphology

The Universal Morphology (UniMorph) project is a collaborative effort pr...
research
10/25/2018

UniMorph 2.0: Universal Morphology

The Universal Morphology UniMorph project is a collaborative effort to i...
research
07/08/2018

On the Complexity and Typology of Inflectional Morphological Systems

We quantify the linguistic complexity of different languages' morphologi...
research
03/16/2022

Morphological Reinflection with Multiple Arguments: An Extended Annotation schema and a Georgian Case Study

In recent years, a flurry of morphological datasets had emerged, most no...
research
06/27/2019

Morphological Irregularity Correlates with Frequency

We present a study of morphological irregularity. Following recent work,...
research
08/13/2018

Comparing morphological complexity of Spanish, Otomi and Nahuatl

We use two small parallel corpora for comparing the morphological comple...
research
10/15/2018

Marrying Universal Dependencies and Universal Morphology

The Universal Dependencies (UD) and Universal Morphology (UniMorph) proj...

Please sign up or login with your details

Forgot password? Click here to reset