Dialectal Layers in West Iranian: a Hierarchical Dirichlet Process Approach to Linguistic Relationships

by   Chundra Aroor Cathcart, et al.

This paper addresses a series of complex and unresolved issues in the historical phonology of West Iranian languages. The West Iranian languages (Persian, Kurdish, Balochi, and other languages) display a high degree of non-Lautgesetzlich behavior. Most of this irregularity is undoubtedly due to language contact; we argue, however, that an oversimplified view of the processes at work has prevailed in the literature on West Iranian dialectology, with specialists assuming that deviations from an expected outcome in a given non-Persian language are due to lexical borrowing from some chronological stage of Persian. It is demonstrated that this qualitative approach yields at times problematic conclusions stemming from the lack of explicit probabilistic inferences regarding the distribution of the data: Persian may not be the sole donor language; additionally, borrowing at the lexical level is not always the mechanism that introduces irregularity. In many cases, the possibility that West Iranian languages show different reflexes in different conditioning environments remains under-explored. We employ a novel Bayesian approach designed to overcome these problems and tease apart the different determinants of irregularity in patterns of West Iranian sound change. Our methodology allows us to provisionally resolve a number of outstanding questions in the literature on West Iranian dialectology concerning the dialectal affiliation of certain sound changes. We outline future directions for work of this sort.


page 3

page 21


Creating Lexical Resources for Endangered Languages

This paper examines approaches to generate lexical resources for endange...

Letters From the Past: Modeling Historical Sound Change Through Diachronic Character Embeddings

While a great deal of work has been done on NLP approaches to lexical se...

In search of isoglosses: continuous and discrete language embeddings in Slavic historical phonology

This paper investigates the ability of neural network architectures to e...

Probing Pretrained Language Models for Lexical Semantics

The success of large pretrained language models (LMs) such as BERT and R...

The language (and series) of Hammersley-type processes

We study languages and formal power series associated to (variants of) H...

Probabilistic Typology: Deep Generative Models of Vowel Inventories

Linguistic typology studies the range of structures present in human lan...

Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian

In this paper we examine the usefulness of two classes of algorithms Dis...

Please sign up or login with your details

Forgot password? Click here to reset