Meaning to Form: Measuring Systematicity as Information

06/13/2019
by   Tiago Pimentel, et al.
0

A longstanding debate in semiotics centers on the relationship between linguistic signs and their corresponding semantics: is there an arbitrary relationship between a word form and its meaning, or does some systematic phenomenon pervade? For instance, does the character bigram gl have any systematic relationship to the meaning of words like glisten, gleam and glow? In this work, we offer a holistic quantification of the systematicity of the sign using mutual information and recurrent neural networks. We employ these in a data-driven and massively multilingual approach to the question, examining 106 languages. We find a statistically significant reduction in entropy when modeling a word form conditioned on its semantic representation. Encouragingly, we also recover well-attested English examples of systematic affixes. We conclude with the meta-point: Our approximate effect size (measured in bits) is quite small---despite some amount of systematicity between form and meaning, an arbitrary relationship and its resulting benefits dominate human language.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2017

Joint Semantic Synthesis and Morphological Analysis of the Derived Word

Much like sentences are composed of words, words themselves are composed...
research
08/09/2015

Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation

We introduce a model for constructing vector representations of words by...
research
12/07/2020

What Meaning-Form Correlation Has to Compose With

Compositionality is a widely discussed property of natural languages, al...
research
05/01/2020

Predicting Declension Class from Form and Meaning

The noun lexica of many natural languages are divided into several decle...
research
06/19/2023

Grammatical gender in Swedish is predictable using recurrent neural networks

The grammatical gender of Swedish nouns is a mystery. While there are fe...
research
08/14/2018

Primal Meaning Recommendation for Chinese Words and Phrases via Descriptions in On-line Encyclopedia

Polysemy is a very common phenomenon in modern languages. Most of previo...
research
04/03/2023

Crossword: A Semantic Approach to Data Compression via Masking

The traditional methods for data compression are typically based on the ...

Please sign up or login with your details

Forgot password? Click here to reset