GeoSeq2Seq: Information Geometric Sequence-to-Sequence Networks

10/25/2017
by   Alessandro Bay, et al.
0

The Fisher information metric is an important foundation of information geometry, wherein it allows us to approximate the local geometry of a probability distribution. Recurrent neural networks such as the Sequence-to-Sequence (Seq2Seq) networks that have lately been used to yield state-of-the-art performance on speech translation or image captioning have so far ignored the geometry of the latent embedding, that they iteratively learn. We propose the information geometric Seq2Seq network which abridges the gap between deep recurrent neural networks and information geometry. Specifically, the latent embedding offered by a recurrent network is encoded as a Fisher kernel of a parametric Gaussian Mixture Model, a formalism common in computer vision. We utilise such a network to predict the shortest routes between two nodes of a graph by learning the adjacency matrix using the information geometric Seq2Seq model; our results show that for such a problem the probabilistic representation of the latent embedding supersedes the non-probabilistic embedding by 10-15

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2018

Approximate Distribution Matching for Sequence-to-Sequence Learning

Sequence-to-Sequence models were introduced to tackle many real-life pro...
research
11/09/2018

AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms

This paper describes a method based on a sequence-to-sequence learning (...
research
10/11/2017

Sequence stacking using dual encoder Seq2Seq recurrent networks

A widely studied non-polynomial (NP) hard problem lies in finding a rout...
research
10/11/2017

StackSeq2Seq: Dual Encoder Seq2Seq Recurrent Networks

A widely studied non-deterministic polynomial time (NP) hard problem lie...
research
09/11/2015

Hessian-free Optimization for Learning Deep Multidimensional Recurrent Neural Networks

Multidimensional recurrent neural networks (MDRNNs) have shown a remarka...
research
03/11/2016

Determination of the edge of criticality in echo state networks through Fisher information maximization

It is a widely accepted fact that the computational capability of recurr...
research
06/03/2020

Classifying histograms of medical data using information geometry of beta distributions

In this paper, we use tools of information geometry to compare, average ...

Please sign up or login with your details

Forgot password? Click here to reset