Progress Extrapolating Algorithmic Learning to Arbitrary Sequence Lengths

03/18/2020
by   Andreas Robinson, et al.
0

Recent neural network models for algorithmic tasks have led to significant improvements in extrapolation to sequences much longer than training, but it remains an outstanding problem that the performance still degrades for very long or adversarial sequences. We present alternative architectures and loss-terms to address these issues, and our testing of these approaches has not detected any remaining extrapolation errors within memory constraints. We focus on linear time algorithmic tasks including copy, parentheses parsing, and binary addition. First, activation binning was used to discretize the trained network in order to avoid computational drift from continuous operations, and a binning-based digital loss term was added to encourage discretizable representations. In addition, a localized differentiable memory (LDM) architecture, in contrast to distributed memory access, addressed remaining extrapolation errors and avoided unbounded growth of internal computational states. Previous work has found that algorithmic extrapolation issues can also be alleviated with approaches relying on program traces, but the current effort does not rely on such traces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2020

Distributed Memory based Self-Supervised Differentiable Neural Computer

A differentiable neural computer (DNC) is a memory augmented neural netw...
research
10/18/2021

State-Space Constraints Improve the Generalization of the Differentiable Neural Computer in some Algorithmic Tasks

Memory-augmented neural networks (MANNs) can solve algorithmic tasks lik...
research
12/07/2016

Neural Turing Machines: Convergence of Copy Tasks

The architecture of neural Turing machines is differentiable end to end ...
research
11/19/2015

Neural Random-Access Machines

In this paper, we propose and investigate a new neural network architect...
research
05/14/2023

Theta sequences as eligibility traces: a biological solution to credit assignment

Credit assignment problems, for example policy evaluation in RL, often r...
research
06/07/2021

On the Skew-Symmetric Binary Sequences and the Merit Factor Problem

The merit factor problem is of practical importance to manifold domains,...
research
05/13/2018

Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery

Reinforcement learning (RL) agents performing complex tasks must be able...

Please sign up or login with your details

Forgot password? Click here to reset