A single-layer RNN can approximate stacked and bidirectional RNNs, and topologies in between

08/30/2019
by   Javier S. Turek, et al.
1

To enhance the expressiveness and representational capacity of recurrent neural networks (RNN), a large body of work has emerged exploring stacked architectures with additional topological modifications like shortcut connections or bidirectionality. However, choosing the best network for a particular problem requires a combinatorial search over architectures and their hyperparameters. In this work, we show that a single-layer RNN can perfectly mimic an arbitrarily deep stacked RNN under specific constraints on its weight matrix and a delay between input and output. This obviates the need to manually select hyperparameters like the number of layers. Additionally, we show that weakening weight constraints while keeping the delay gives rise to partial acausality in the single-layer RNN, much like a bidirectional network. Synthetic experiments confirm that the delayed RNN can mimic bidirectional networks in perfectly solving some acausal tasks, outperforming them in others. Finally, we show that in a challenging language processing task, the delayed RNN performs within 0.3% of the accuracy of the bidirectional network while reducing computational costs.

READ FULL TEXT

page 7

page 16

research
02/09/2015

Gated Feedback Recurrent Neural Networks

In this work, we propose a novel recurrent neural network (RNN) architec...
research
11/29/2016

Capacity and Trainability in Recurrent Neural Networks

Two potential bottlenecks on the expressiveness of recurrent neural netw...
research
11/06/2017

Neural Speed Reading via Skim-RNN

Inspired by the principles of speed reading, we introduce Skim-RNN, a re...
research
11/08/2017

A New Hybrid-parameter Recurrent Neural Networks for Online Handwritten Chinese Character Recognition

The recurrent neural network (RNN) is appropriate for dealing with tempo...
research
08/03/2019

Sentiment Analysis of Typhoon Related Tweets using Standard and Bidirectional Recurrent Neural Networks

The Philippines is a common ground to natural calamities like typhoons, ...
research
02/03/2018

Densely Connected Bidirectional LSTM with Applications to Sentence Classification

Deep neural networks have recently been shown to achieve highly competit...
research
04/01/2020

Distance and Equivalence between Finite State Machines and Recurrent Neural Networks: Computational results

The need of interpreting Deep Learning (DL) models has led, during the p...

Please sign up or login with your details

Forgot password? Click here to reset