Coordination Among Neural Modules Through a Shared Global Workspace

03/01/2021
by   Anirudh Goyal, et al.
14

Deep learning has seen a movement away from representing examples with a monolithic hidden state towards a richly structured state. For example, Transformers segment by position, and object-centric architectures decompose images into entities. In all these architectures, interactions between different elements are modeled via pairwise interactions: Transformers make use of self-attention to incorporate information from other positions; object-centric architectures make use of graph neural networks to model interactions among entities. However, pairwise interactions may not achieve global coordination or a coherent, integrated representation that can be used for downstream tasks. In cognitive science, a global workspace architecture has been proposed in which functionally specialized components share information through a common, bandwidth-limited communication channel. We explore the use of such a communication channel in the context of deep learning for modeling the structure of complex environments. The proposed method includes a shared workspace through which communication among different specialist modules takes place but due to limits on the communication bandwidth, specialist modules must compete for access. We show that capacity limitations have a rational basis in that (1) they encourage specialization and compositionality and (2) they facilitate the synchronization of otherwise independent specialists.

READ FULL TEXT

page 4

page 6

page 14

page 18

page 23

research
07/01/2022

Rethinking Query-Key Pairwise Interactions in Vision Transformers

Vision Transformers have achieved state-of-the-art performance in many v...
research
06/02/2023

Centered Self-Attention Layers

The self-attention mechanism in transformers and the message-passing mec...
research
07/06/2021

Discrete-Valued Neural Communication

Deep learning has advanced from fully connected architectures to structu...
research
08/09/2021

TransForensics: Image Forgery Localization with Dense Self-Attention

Nowadays advanced image editing tools and technical skills produce tampe...
research
03/02/2021

Neural Production Systems

Visual environments are structured, consisting of distinct objects or en...
research
02/27/2021

Transformers with Competitive Ensembles of Independent Mechanisms

An important development in deep learning from the earliest MLPs has bee...

Please sign up or login with your details

Forgot password? Click here to reset