In search of isoglosses: continuous and discrete language embeddings in Slavic historical phonology

05/27/2020
by   Chundra A. Cathcart, et al.
0

This paper investigates the ability of neural network architectures to effectively learn diachronic phonological generalizations in a multilingual setting. We employ models using three different types of language embedding (dense, sigmoid, and straight-through). We find that the Straight-Through model outperforms the other two in terms of accuracy, but the Sigmoid model's language embeddings show the strongest agreement with the traditional subgrouping of the Slavic languages. We find that the Straight-Through model has learned coherent, semi-interpretable information about sound change, and outline directions for future research.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset