Efficient human-like semantic representations via the Information Bottleneck principle

08/09/2018
by   Noga Zaslavsky, et al.
0

Maintaining efficient semantic representations of the environment is a major challenge both for humans and for machines. While human languages represent useful solutions to this problem, it is not yet clear what computational principle could give rise to similar solutions in machines. In this work we propose an answer to this open question. We suggest that languages compress percepts into words by optimizing the Information Bottleneck (IB) tradeoff between the complexity and accuracy of their lexicons. We present empirical evidence that this principle may give rise to human-like semantic representations, by exploring how human languages categorize colors. We show that color naming systems across languages are near-optimal in the IB sense, and that these natural systems are similar to artificial IB color naming systems with a single tradeoff parameter controlling the cross-language variability. In addition, the IB systems evolve through a sequence of structural phase transitions, demonstrating a possible adaptation process. This work thus identifies a computational principle that characterizes human semantic systems, and that could usefully inform semantic representations in machines.

READ FULL TEXT
research
05/11/2019

Semantic categories of artifacts and animals reflect efficient coding

It has been argued that semantic categories across languages reflect pre...
research
06/30/2022

Towards Human-Agent Communication via the Information Bottleneck Principle

Emergent communication research often focuses on optimizing task-specifi...
research
05/17/2023

Iterated learning and communication jointly explain efficient color naming systems

It has been argued that semantic systems reflect pressure for efficiency...
research
01/09/2019

What do Language Representations Really Represent?

A neural language model trained on a text corpus can be used to induce d...
research
09/30/2021

A surprisal–duration trade-off across and within the world's languages

While there exist scores of natural languages, each with its unique feat...
research
05/13/2020

A Rate-Distortion view of human pragmatic reasoning

What computational principles underlie human pragmatic reasoning? A prom...
research
10/03/2019

Modeling Color Terminology Across Thousands of Languages

There is an extensive history of scholarship into what constitutes a "ba...

Please sign up or login with your details

Forgot password? Click here to reset