Grid Cell Path Integration For Movement-Based Visual Object Recognition

02/17/2021
by   Niels Leadholm, et al.
26

Grid cells enable the brain to model the physical space of the world and navigate effectively via path integration, updating self-position using information from self-movement. Recent proposals suggest that the brain might use similar mechanisms to understand the structure of objects in diverse sensory modalities, including vision. In machine vision, object recognition given a sequence of sensory samples of an image, such as saccades, is a challenging problem when the sequence does not follow a consistent, fixed pattern - yet this is something humans do naturally and effortlessly. We explore how grid cell-based path integration in a cortical network can support reliable recognition of objects given an arbitrary sequence of inputs. Our network (GridCellNet) uses grid cell computations to integrate visual information and make predictions based on movements. We use local Hebbian plasticity rules to learn rapidly from a handful of examples (few-shot learning), and consider the task of recognizing MNIST digits given only a sequence of image feature patches. We compare GridCellNet to k-Nearest Neighbour (k-NN) classifiers as well as recurrent neural networks (RNNs), both of which lack explicit mechanisms for handling arbitrary sequences of input samples. We show that GridCellNet can reliably perform classification, generalizing to both unseen examples and completely novel sequence trajectories. We further show that inference is often successful after sampling a fraction of the input space, enabling the predictive GridCellNet to reconstruct the rest of the image given just a few movements. We propose that dynamically moving agents with active sensors can use grid cell representations not only for navigation, but also for efficient recognition and feature prediction of seen objects.

READ FULL TEXT
research
03/21/2018

Emergence of grid-like representations by training recurrent neural networks to perform spatial localization

Decades of research on the neural code underlying spatial navigation hav...
research
09/07/2021

Capturing the objects of vision with neural networks

Human visual perception carves a scene at its physical joints, decomposi...
research
09/10/2017

Fully Convolutional Neural Networks for Dynamic Object Detection in Grid Maps

Grid maps are widely used in robotics to represent obstacles in the envi...
research
08/12/2018

Scene-LSTM: A Model for Human Trajectory Prediction

We develop a human movement trajectory prediction system that incorporat...
research
08/23/2023

Characterising representation dynamics in recurrent neural networks for object recognition

Recurrent neural networks (RNNs) have yielded promising results for both...
research
09/23/2021

Lifelong 3D Object Recognition and Grasp Synthesis Using Dual Memory Recurrent Self-Organization Networks

Humans learn to recognize and manipulate new objects in lifelong setting...

Please sign up or login with your details

Forgot password? Click here to reset