Unsupervised Morphological Paradigm Completion

05/03/2020
by   Huiming Jin, et al.
0

We propose the task of unsupervised morphological paradigm completion. Given only raw text and a lemma list, the task consists of generating the morphological paradigms, i.e., all inflected forms, of the lemmas. From a natural language processing (NLP) perspective, this is a challenging unsupervised task, and high-performing systems have the potential to improve tools for low-resource languages or to assist linguistic annotators. From a cognitive science perspective, this can shed light on how children acquire morphological knowledge. We further introduce a system for the task, which generates morphological paradigms via the following steps: (i) EDIT TREE retrieval, (ii) additional lemma retrieval, (iii) paradigm size discovery, and (iv) inflection generation. We perform an evaluation on 14 typologically diverse languages. Our system outperforms trivial baselines with ease and, for some languages, even obtains a higher accuracy than minimally supervised systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2020

The IMS-CUBoulder System for the SIGMORPHON 2020 Shared Task on Unsupervised Morphological Paradigm Completion

In this paper, we present the systems of the University of Stuttgart IMS...
research
07/08/2018

On the Complexity and Typology of Inflectional Morphological Systems

We quantify the linguistic complexity of different languages' morphologi...
research
03/16/2022

Morphological Processing of Low-Resource Languages: Where We Are and What's Next

Automatic morphological processing can aid downstream natural language p...
research
09/24/2018

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

Neural state-of-the-art sequence-to-sequence (seq2seq) models often do n...
research
05/25/2023

Morphological Inflection: A Reality Check

Morphological inflection is a popular task in sub-word NLP with both pra...
research
05/04/2020

The Paradigm Discovery Problem

This work treats the paradigm discovery problem (PDP), the task of learn...
research
04/28/2020

KoParadigm: A Korean Conjugation Paradigm Generator

Korean is a morphologically rich language. Korean verbs change their for...

Please sign up or login with your details

Forgot password? Click here to reset