Morphological Irregularity Correlates with Frequency

06/27/2019
by   Shijie Wu, et al.
0

We present a study of morphological irregularity. Following recent work, we define an information-theoretic measure of irregularity based on the predictability of forms in a language. Using a neural transduction model, we estimate this quantity for the forms in 28 languages. We first present several validatory and exploratory analyses of irregularity. We then show that our analyses provide evidence for a correlation between irregularity and frequency: higher frequency items are more likely to be irregular and irregular items are more likely be highly frequent. To our knowledge, this result is the first of its breadth and confirms longstanding proposals from the linguistics literature. The correlation is more robust when aggregated at the level of whole paradigms--providing support for models of linguistic structure in which inflected forms are unified by abstract underlying stems or lexemes. Code is available at https://github.com/shijie-wu/neural-transducer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2018

On the Complexity and Typology of Inflectional Morphological Systems

We quantify the linguistic complexity of different languages' morphologi...
research
04/04/2019

A Simple Joint Model for Improved Contextual Neural Lemmatization

English verbs have multiple forms. For instance, talk may also appear as...
research
05/10/2023

K-UniMorph: Korean Universal Morphology and its Feature Schema

We present in this work a new Universal Morphology dataset for Korean. P...
research
08/09/2023

Information-Theoretic Characterization of Vowel Harmony: A Cross-Linguistic Study on Word Lists

We present a cross-linguistic study that aims to quantify vowel harmony ...
research
02/27/2020

Understanding and Enhancing Mixed Sample Data Augmentation

Mixed Sample Data Augmentation (MSDA) has received increasing attention ...
research
01/22/2021

Using Finite-State Machines to Automatically Scan Classical Greek Hexameter

This paper presents a fully automatic approach to the scansion of Classi...

Please sign up or login with your details

Forgot password? Click here to reset