Meaning to Form: Measuring Systematicity as Information

by   Tiago Pimentel, et al.

A longstanding debate in semiotics centers on the relationship between linguistic signs and their corresponding semantics: is there an arbitrary relationship between a word form and its meaning, or does some systematic phenomenon pervade? For instance, does the character bigram gl have any systematic relationship to the meaning of words like glisten, gleam and glow? In this work, we offer a holistic quantification of the systematicity of the sign using mutual information and recurrent neural networks. We employ these in a data-driven and massively multilingual approach to the question, examining 106 languages. We find a statistically significant reduction in entropy when modeling a word form conditioned on its semantic representation. Encouragingly, we also recover well-attested English examples of systematic affixes. We conclude with the meta-point: Our approximate effect size (measured in bits) is quite small---despite some amount of systematicity between form and meaning, an arbitrary relationship and its resulting benefits dominate human language.


page 1

page 2

page 3

page 4


Joint Semantic Synthesis and Morphological Analysis of the Derived Word

Much like sentences are composed of words, words themselves are composed...

Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation

We introduce a model for constructing vector representations of words by...

What Meaning-Form Correlation Has to Compose With

Compositionality is a widely discussed property of natural languages, al...

Predicting Declension Class from Form and Meaning

The noun lexica of many natural languages are divided into several decle...

Enriching Word Embeddings with Temporal and Spatial Information

The meaning of a word is closely linked to sociocultural factors that ca...

Primal Meaning Recommendation for Chinese Words and Phrases via Descriptions in On-line Encyclopedia

Polysemy is a very common phenomenon in modern languages. Most of previo...

Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolution

It is often posited that more predictable parts of a speaker's meaning t...