Low-Rank Hidden State Embeddings for Viterbi Sequence Labeling

08/02/2017
by   Dung Thai, et al.
0

In textual information extraction and other sequence labeling tasks it is now common to use recurrent neural networks (such as LSTM) to form rich embedded representations of long-term input co-occurrence patterns. Representation of output co-occurrence patterns is typically limited to a hand-designed graphical model, such as a linear-chain CRF representing short-term Markov dependencies among successive labels. This paper presents a method that learns embedded representations of latent output structure in sequence data. Our model takes the form of a finite-state machine with a large number of latent states per label (a latent variable CRF), where the state-transition matrix is factorized---effectively forming an embedded representation of state-transitions capable of enforcing long-term label dependencies, while supporting exact Viterbi inference over output labels. We demonstrate accuracy improvements and interpretable latent structure in a synthetic but complex task based on CoNLL named entity recognition.

READ FULL TEXT
research
09/28/2018

Embedded-State Latent Conditional Random Fields for Sequence Labeling

Complex textual information extraction tasks are often posed as sequence...
research
08/01/2016

Structured prediction models for RNN based sequence labeling in clinical text

Sequence labeling is a widely used method for named entity recognition a...
research
12/19/2020

Uncertainty-Aware Label Refinement for Sequence Labeling

Conditional random fields (CRF) for label decoding has become ubiquitous...
research
08/23/2019

Hierarchically-Refined Label Attention Network for Sequence Labeling

CRF has been used as a powerful model for statistical sequence labeling....
research
08/24/2019

Enhancing Neural Sequence Labeling with Position-Aware Self-Attention

Sequence labeling is a fundamental task in natural language processing a...
research
11/09/2019

Factored Latent-Dynamic Conditional Random Fields for Single and Multi-label Sequence Modeling

Conditional Random Fields (CRF) are frequently applied for labeling and ...
research
11/11/2020

An Investigation of Potential Function Designs for Neural CRF

The neural linear-chain CRF model is one of the most widely-used approac...

Please sign up or login with your details

Forgot password? Click here to reset