Modeling natural language emergence with integral transform theory and reinforcement learning

11/30/2018
by   Bohdan Khomtchouk, et al.
0

Zipf's law predicts a power-law relationship between word rank and frequency in language communication systems and has been widely reported in a variety of natural language processing applications. However, the emergence of natural language is often modeled as a function of bias between speaker and listener interests, which lacks a direct way of relating information-theoretic bias to Zipfian rank. A function of bias also serves as an unintuitive interpretation of the communicative effort exchanged between a speaker and a listener. We counter these shortcomings by proposing a novel integral transform and kernel for mapping communicative bias functions to corresponding word frequency-rank representations at any arbitrary phase transition point, resulting in a direct way to link communicative effort (modeled by speaker/listener bias) to specific vocabulary used (represented by word rank). We demonstrate the practical utility of our integral transform by showing how a change from bias to rank results in greater accuracy and performance at an image classification task for assigning word labels to images randomly subsampled from CIFAR10. We model this task as a reinforcement learning game between a speaker and listener and compare the relative impact of bias and Zipfian word rank on communicative performance (and accuracy) between the two agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2016

Zipf's law emerges asymptotically during phase transitions in communicative systems

Zipf's law predicts a power-law relationship between word rank and frequ...
research
05/24/2021

The advent and fall of a vocabulary learning bias from communicative efficiency

It is well-known that, when sufficiently young children encounter a new ...
research
10/02/2019

Natural Language State Representation for Reinforcement Learning

Recent advances in Reinforcement Learning have highlighted the difficult...
research
01/24/2022

Bias in Automated Speaker Recognition

Automated speaker recognition uses data processing to identify speakers ...
research
09/09/2021

Debiasing Methods in Natural Language Understanding Make Bias More Accessible

Model robustness to bias is often determined by the generalization on ca...
research
05/16/2017

Agent-based model for the origins of scaling in human language

Background/Introduction: The Zipf's law establishes that if the words of...
research
08/07/2015

Automata networks model for alignment and least effort on vocabulary formation

Can artificial communities of agents develop language with scaling relat...

Please sign up or login with your details

Forgot password? Click here to reset