Universal Topological Regularities of Syntactic Structures: Decoupling Efficiency from Optimization

Human syntactic structures are usually represented as graphs. Much research has focused on the mapping between such graphs and linguistic sequences, but less attention has been paid to the shapes of the graphs themselves: their topologies. This study investigates how the topologies of syntactic graphs reveal traces of the processes that led to their emergence. I report a new universal regularity in syntactic structures: Their topology is communicatively efficient above chance. The pattern holds, without exception, for all 124 languages studied, across linguistic families and modalities (spoken, written, and signed). This pattern can arise from a process optimizing for communicative efficiency or, alternatively, by construction, as a by-effect of a sublinear preferential attachment process reflecting language production mechanisms known from psycholinguistics. This dual explanation shows how communicative efficiency, per se, does not require optimization. Among the two options, efficiency without optimization offers the better explanation for the new pattern.

READ FULL TEXT
research
03/12/2019

Topological Analysis of Syntactic Structures

We use the persistent homology method of topological data analysis and d...
research
07/17/2019

Leveraging Linguistic Characteristics for Bipolar Disorder Recognition with Gender Differences

Most previous studies on automatic recognition model for bipolar disorde...
research
12/15/2021

Oracle Linguistic Graphs Complement a Pretrained Transformer Language Model: A Cross-formalism Comparison

We examine the extent to which, in principle, linguistic graph represent...
research
12/01/2021

Remixing Functionally Graded Structures: Data-Driven Topology Optimization with Multiclass Shape Blending

To create heterogeneous, multiscale structures with unprecedented functi...
research
06/26/2023

Prefix-free graphs and suffix array construction in sublinear space

A recent paradigm shift in bioinformatics from a single reference genome...
research
07/08/2021

COMBO: a new module for EUD parsing

We introduce the COMBO-based approach for EUD parsing and its implementa...
research
02/23/2021

Paraphrases do not explain word analogies

Many types of distributional word embeddings (weakly) encode linguistic ...

Please sign up or login with your details

Forgot password? Click here to reset