In search of isoglosses: continuous and discrete language embeddings in Slavic historical phonology

05/27/2020
by   Chundra A. Cathcart, et al.
0

This paper investigates the ability of neural network architectures to effectively learn diachronic phonological generalizations in a multilingual setting. We employ models using three different types of language embedding (dense, sigmoid, and straight-through). We find that the Straight-Through model outperforms the other two in terms of accuracy, but the Sigmoid model's language embeddings show the strongest agreement with the traditional subgrouping of the Slavic languages. We find that the Straight-Through model has learned coherent, semi-interpretable information about sound change, and outline directions for future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2016

Continuous multilinguality with language vectors

Most existing models for multilingual natural language processing (NLP) ...
research
08/07/2019

Ab Antiquo: Proto-language Reconstruction with RNNs

Historical linguists have identified regularities in the process of hist...
research
05/17/2022

Letters From the Past: Modeling Historical Sound Change Through Diachronic Character Embeddings

While a great deal of work has been done on NLP approaches to lexical se...
research
01/13/2020

Dialectal Layers in West Iranian: a Hierarchical Dirichlet Process Approach to Linguistic Relationships

This paper addresses a series of complex and unresolved issues in the hi...
research
10/09/2018

A Fast, Compact, Accurate Model for Language Identification of Codemixed Text

We address fine-grained multilingual language identification: providing ...
research
04/07/2022

Detecting Vocal Fatigue with Neural Embeddings

Vocal fatigue refers to the feeling of tiredness and weakness of voice d...
research
09/20/2018

Machine Learning for semi linear PDEs

Recent machine learning algorithms dedicated to solving semi-linear PDEs...

Please sign up or login with your details

Forgot password? Click here to reset