DeepAI AI Chat
Log In Sign Up

Active Predictive Coding Networks: A Neural Solution to the Problem of Learning Reference Frames and Part-Whole Hierarchies

01/14/2022
by   Dimitrios C. Gklezakos, et al.
University of Washington
0

We introduce Active Predictive Coding Networks (APCNs), a new class of neural networks that solve a major problem posed by Hinton and others in the fields of artificial intelligence and brain modeling: how can neural networks learn intrinsic reference frames for objects and parse visual scenes into part-whole hierarchies by dynamically allocating nodes in a parse tree? APCNs address this problem by using a novel combination of ideas: (1) hypernetworks are used for dynamically generating recurrent neural networks that predict parts and their locations within intrinsic reference frames conditioned on higher object-level embedding vectors, and (2) reinforcement learning is used in conjunction with backpropagation for end-to-end learning of model parameters. The APCN architecture lends itself naturally to multi-level hierarchical learning and is closely related to predictive coding models of cortical function. Using the MNIST, Fashion-MNIST and Omniglot datasets, we demonstrate that APCNs can (a) learn to parse images into part-whole hierarchies, (b) learn compositional representations, and (c) transfer their knowledge to unseen classes of objects. With their ability to dynamically generate parse trees with part locations for objects, APCNs offer a new framework for explainable AI that leverages advances in deep learning while retaining interpretability and compositionality.

READ FULL TEXT

page 5

page 10

page 12

page 13

page 14

06/16/2022

Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies

Human vision involves parsing and representing objects and scenes using ...
10/31/2016

Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks

We propose a convolutional recurrent neural network, with Winner-Take-Al...
12/05/2018

HEVC Inter Coding Using Deep Recurrent Neural Networks and Artificial Reference Pictures

The efficiency of motion compensated prediction in modern video codecs h...
11/19/2015

Unsupervised Learning of Visual Structure using Predictive Generative Networks

The ability to predict future states of the environment is a central pil...
08/03/2016

Learning a Driving Simulator

Comma.ai's approach to Artificial Intelligence for self-driving cars is ...
01/27/2021

Self-Calibrating Active Binocular Vision via Active Efficient Coding with Deep Autoencoders

We present a model of the self-calibration of active binocular vision co...