Tensor Networks for Language Modeling

03/02/2020
by   Jacob Miller, et al.
0

The tensor network formalism has enjoyed over two decades of success in modeling the behavior of complex quantum-mechanical systems, but has only recently and sporadically been leveraged in machine learning. Here we introduce a uniform matrix product state (u-MPS) model for probabilistic modeling of sequence data. We identify several distinctive features of this recurrent generative model, notably the ability to condition or marginalize sampling on characters at arbitrary locations within a sequence, with no need for approximate sampling methods. Despite the sequential architecture of u-MPS, we show that a recursive evaluation algorithm can be used to parallelize its inference and training, with a string of length n only requiring parallel time O(log n) to evaluate. Experiments on a context-free language demonstrate a strong capacity to learn grammatical structure from limited data, pointing towards the potential of tensor networks for language modeling applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2016

Recurrent Neural Network Grammars

We introduce recurrent neural network grammars, probabilistic models of ...
research
08/20/2017

Neural Networks Compression for Language Modeling

In this paper, we consider several compression techniques for the langua...
research
01/31/2019

A Generalized Language Model in Tensor Space

In the literature, tensors have been effectively used for capturing the ...
research
01/08/2019

Tree Tensor Networks for Generative Modeling

Matrix product states (MPS), a tensor network designed for one-dimension...
research
10/27/2016

Professor Forcing: A New Algorithm for Training Recurrent Networks

The Teacher Forcing algorithm trains recurrent networks by supplying obs...
research
10/15/2018

Trellis Networks for Sequence Modeling

We present trellis networks, a new architecture for sequence modeling. O...
research
08/28/2018

A Quantum Many-body Wave Function Inspired Language Modeling Approach

The recently proposed quantum language model (QLM) aimed at a principled...

Please sign up or login with your details

Forgot password? Click here to reset