A Generalized Language Model in Tensor Space

01/31/2019
by   Lipeng Zhang, et al.
0

In the literature, tensors have been effectively used for capturing the context information in language models. However, the existing methods usually adopt relatively-low order tensors, which have limited expressive power in modeling language. Developing a higher-order tensor representation is challenging, in terms of deriving an effective solution and showing its generality. In this paper, we propose a language model named Tensor Space Language Model (TSLM), by utilizing tensor networks and tensor decomposition. In TSLM, we build a high-dimensional semantic space constructed by the tensor product of word vectors. Theoretically, we prove that such tensor representation is a generalization of the n-gram language model. We further show that this high-order tensor representation can be decomposed to a recursive calculation of conditional probability for language modeling. The experimental results on Penn Tree Bank (PTB) dataset and WikiText benchmark demonstrate the effectiveness of TSLM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2021

Robust Eigenvectors of Symmetric Tensors

The tensor power method generalizes the matrix power method to higher or...
research
06/01/2023

Faster Robust Tensor Power Method for Arbitrary Order

Tensor decomposition is a fundamental method used in various areas to de...
research
12/09/2022

Decomposable Sparse Tensor on Tensor Regression

Most regularized tensor regression research focuses on tensors predictor...
research
03/02/2020

Tensor Networks for Language Modeling

The tensor network formalism has enjoyed over two decades of success in ...
research
10/27/2017

Tensor network language model

We propose a new statistical model suitable for machine learning of syst...
research
08/16/2016

Fast, Small and Exact: Infinite-order Language Modelling with Compressed Suffix Trees

Efficient methods for storing and querying are critical for scaling high...
research
07/24/2016

Latent Tree Language Model

In this paper we introduce Latent Tree Language Model (LTLM), a novel ap...

Please sign up or login with your details

Forgot password? Click here to reset