Constructing a Family Tree of Ten Indo-European Languages with Delexicalized Cross-linguistic Transfer Patterns

07/17/2020
by   Yuanyuan Zhao, et al.
0

It is reasonable to hypothesize that the divergence patterns formulated by historical linguists and typologists reflect constraints on human languages, and are thus consistent with Second Language Acquisition (SLA) in a certain way. In this paper, we validate this hypothesis on ten Indo-European languages. We formalize the delexicalized transfer as interpretable tree-to-string and tree-to-tree patterns which can be automatically induced from web data by applying neural syntactic parsing and grammar induction technologies. This allows us to quantitatively probe cross-linguistic transfer and extend inquiries of SLA. We extend existing works which utilize mixed features and support the agreement between delexicalized cross-linguistic transfer and the phylogenetic structure resulting from the historical-comparative paradigm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2017

Phylogenetics of Indo-European Language families via an Algebro-Geometric Analysis of their Syntactic Structures

Using Phylogenetic Algebraic Geometry, we analyze computationally the ph...
research
12/21/2022

Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer?

Multilingual BERT (mBERT) has demonstrated considerable cross-lingual sy...
research
01/03/2014

Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian

In this paper we examine the usefulness of two classes of algorithms Dis...
research
01/29/2018

Geospatial distributions reflect rates of evolution of features of language

Different structural features of human language change at different rate...
research
02/03/2020

Phylogenetic signal in phonotactics

Phylogenetic methods have broad potential in linguistics beyond tree inf...
research
06/16/2020

Ranking Transfer Languages with Pragmatically-Motivated Features for Multilingual Sentiment Analysis

Cross-lingual transfer learning studies how datasets, annotations, and m...
research
05/09/2018

Three tree priors and five datasets: A study of the effect of tree priors in Indo-European phylogenetics

The age of the root of the Indo-European language family has received mu...

Please sign up or login with your details

Forgot password? Click here to reset