A Linear Dynamical System Model for Text

02/13/2015
by   David Belanger, et al.
0

Low dimensional representations of words allow accurate NLP models to be trained on limited annotated data. While most representations ignore words' local context, a natural way to induce context-dependent representations is to perform inference in a probabilistic latent-variable sequence model. Given the recent success of continuous vector space word representations, we provide such an inference procedure for continuous states, where words' representations are given by the posterior mean of a linear dynamical system. Here, efficient inference can be performed using Kalman filtering. Our learning algorithm is extremely scalable, operating on simple cooccurrence counts for both parameter initialization using the method of moments and subsequent iterations of EM. In our experiments, we employ our inferred word embeddings as features in standard tagging tasks, obtaining significant accuracy improvements. Finally, the Kalman filter updates can be seen as a linear recurrent neural network. We demonstrate that using the parameters of our model to initialize a non-linear recurrent neural network language model reduces its training time by a day and yields lower perplexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2023

System Identification for Continuous-time Linear Dynamical Systems

The problem of system identification for the Kalman filter, relying on t...
research
07/21/2021

KalmanNet: Neural Network Aided Kalman Filtering for Partially Known Dynamics

Real-time state estimation of dynamical systems is a fundamental task in...
research
07/04/2012

Predictive Linear-Gaussian Models of Stochastic Dynamical Systems

Models of dynamical systems based on predictive state representations (P...
research
03/07/2016

A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

This paper presents a novel latent variable recurrent neural network arc...
research
02/27/2017

Dynamic Word Embeddings

We present a probabilistic language model for time-stamped text data whi...
research
02/17/2017

Estimating Nonlinear Dynamics with the ConvNet Smoother

Estimating the state of a dynamical system from a series of noise-corrup...
research
02/14/2017

Practical Learning of Predictive State Representations

Over the past decade there has been considerable interest in spectral al...

Please sign up or login with your details

Forgot password? Click here to reset