Relating transformers to models and neural representations of the hippocampal formation

12/07/2021
by   James C. R. Whittington, et al.
0

Many deep neural network architectures loosely based on brain networks have recently been shown to replicate neural firing patterns observed in the brain. One of the most exciting and promising novel architectures, the Transformer neural network, was developed without the brain in mind. In this work, we show that transformers, when equipped with recurrent position encodings, replicate the precisely tuned spatial representations of the hippocampal formation; most notably place and grid cells. Furthermore, we show that this result is no surprise since it is closely related to current hippocampal models from neuroscience. We additionally show the transformer version offers dramatic performance gains over the neuroscience version. This work continues to bind computations of artificial and brain networks, offers a novel understanding of the hippocampal-cortical interaction, and suggests how wider cortical areas may perform complex tasks beyond current neuroscience models such as language comprehension.

READ FULL TEXT

page 3

page 8

page 19

page 20

research
07/04/2015

Modeling the Mind: A brief review

The brain is a powerful tool used to achieve amazing feats. There have b...
research
05/23/2018

Generalisation of structural knowledge in the Hippocampal-Entorhinal system

A central problem to understanding intelligence is the concept of genera...
research
07/07/2020

The curious case of developmental BERTology: On sparsity, transfer learning, generalization and the brain

In this essay, we explore a point of intersection between deep learning ...
research
12/13/2018

Ablation of a Robot's Brain: Neural Networks Under a Knife

It is still not fully understood exactly how neural networks are able to...
research
07/19/2022

Formal Algorithms for Transformers

This document aims to be a self-contained, mathematically precise overvi...
research
01/01/2023

Causal Deep Learning: Causal Capsules and Tensor Transformers

We derive a set of causal deep neural networks whose architectures are a...
research
04/03/2021

Explanatory models in neuroscience: Part 1 – taking mechanistic abstraction seriously

Despite the recent success of neural network models in mimicking animal ...

Please sign up or login with your details

Forgot password? Click here to reset