Morphology Without Borders: Clause-Level Morphological Annotation

02/25/2022
by   Omer Goldman, et al.
0

Morphological tasks use large multi-lingual datasets that organize words into inflection tables, which then serve as training and evaluation data for various tasks. However, a closer inspection of these data reveals profound cross-linguistic inconsistencies, that arise from the lack of a clear linguistic and operational definition of what is a word, and that severely impair the universality of the derived tasks. To overcome this deficiency, we propose to view morphology as a clause-level phenomenon, rather than word-level. It is anchored in a fixed yet inclusive set of features homogeneous across languages, that encapsulates all functions realized in a saturated clause. We deliver MightyMorph, a novel dataset for clause-level morphology covering 4 typologically-different languages: English, German, Turkish and Hebrew. We use this dataset to derive 3 clause-level morphological tasks: inflection, reinflection and analysis. Our experiments show that the clause-level tasks are substantially harder than the respective word-level tasks, while having comparable complexity across languages. Furthermore, redefining morphology to the clause-level provides a neat interface with contextualized language models (LMs) and can be used to probe LMs capacity to encode complex morphology. Taken together, this work opens up new horizons in the study of computational morphology, leaving ample space for studying neural morphological modeling cross-linguistically.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2021

Evaluation of Morphological Embeddings for English and Russian Languages

This paper evaluates morphology-based embeddings for English and Russian...
research
04/05/2020

A Resource for Studying Chatino Verbal Morphology

We present the first resource focusing on the verbal inflectional morpho...
research
07/04/2019

Morphological Word Embeddings

Linguistic similarity is multi-faceted. For instance, two words may be s...
research
04/17/2021

Minimal Supervision for Morphological Inflection

Neural models for the various flavours of morphological inflection tasks...
research
11/15/2020

Morphologically Aware Word-Level Translation

We propose a novel morphologically aware probability model for bilingual...
research
05/30/2017

Morphological Error Detection in 3D Segmentations

Deep learning algorithms for connectomics rely upon localized classifica...
research
08/15/2019

What's Wrong with Hebrew NLP? And How to Make it Right

For languages with simple morphology, such as English, automatic annotat...

Please sign up or login with your details

Forgot password? Click here to reset