The optimality of syntactic dependency distances

07/30/2020
by   Ramon Ferrer-i-Cancho, et al.
0

It is often stated that human languages, as other biological systems, are shaped by cost-cutting pressures but, to what extent? Attempts to quantify the degree of optimality of languages by means of an optimality score have been scarce and focused mostly on English. Here we recast the problem of the optimality of the word order of a sentence as an optimization problem on a spatial network where the vertices are words, arcs indicate syntactic dependencies and the space is defined by the linear order of the words in the sentence. We introduce a new score to quantify the cognitive pressure to reduce the distance between linked words in a sentence. The analysis of sentences from 93 languages representing 19 linguistic families reveals that half of languages are optimized to a 70 significantly reduced in a few languages and confirms two theoretical predictions, i.e. that longer sentences are more optimized and that distances are more likely to be longer than expected by chance in short sentences. We present a new hierarchical ranking of languages by their degree of optimization. The statistical advantages of the new score call for a reevaluation of the evolution of dependency distance over time in languages as well as the relationship between dependency distance and linguistic competence. Finally, the principles behind the design of the score can be extended to develop more powerful normalizations of topological distances or physical distances in more dimensions.

READ FULL TEXT

page 1

page 7

page 29

page 30

page 31

research
11/26/2022

The distribution of syntactic dependency distances

The syntactic structure of a sentence can be represented as a graph wher...
research
07/07/2021

Linear-time calculation of the expected sum of edge lengths in random projective linearizations of trees

The syntactic structure of a sentence is often represented using syntact...
research
03/24/2017

Are crossing dependencies really scarce?

The syntactic structure of a sentence can be modelled as a tree, where v...
research
08/19/2019

Memory limitations are hidden in grammar

The ability to produce and understand an unlimited number of different s...
research
06/13/2019

Anti dependency distance minimization in short sequences. A graph theoretic approach

Dependency distance minimization (DDm) is a word order principle favouri...
research
08/22/2022

The optimality of word lengths. Theoretical foundations and an empirical study

One of the most robust patterns found in human languages is Zipf's law o...

Please sign up or login with your details

Forgot password? Click here to reset