Embedded-State Latent Conditional Random Fields for Sequence Labeling

09/28/2018
by   Dung Thai, et al.
2

Complex textual information extraction tasks are often posed as sequence labeling or shallow parsing, where fields are extracted using local labels made consistent through probabilistic inference in a graphical model with constrained transitions. Recently, it has become common to locally parametrize these models using rich features extracted by recurrent neural networks (such as LSTM), while enforcing consistent outputs through a simple linear-chain model, representing Markovian dependencies between successive labels. However, the simple graphical model structure belies the often complex non-local constraints between output labels. For example, many fields, such as a first name, can only occur a fixed number of times, or in the presence of other fields. While RNNs have provided increasingly powerful context-aware local features for sequence tagging, they have yet to be integrated with a global graphical model of similar expressivity in the output distribution. Our model goes beyond the linear chain CRF to incorporate multiple hidden states per output label, but parametrizes their transitions parsimoniously with low-rank log-potential scoring matrices, effectively learning an embedding space for hidden states. This augmented latent space of inference variables complements the rich feature representation of the RNN, and allows exact global inference obeying complex, learned non-local output constraints. We experiment with several datasets and show that the model outperforms baseline CRF+RNN models when global output constraints are necessary at inference-time, and explore the interpretable latent structure.

READ FULL TEXT

page 2

page 7

page 8

research
08/02/2017

Low-Rank Hidden State Embeddings for Viterbi Sequence Labeling

In textual information extraction and other sequence labeling tasks it i...
research
06/14/2021

Constraining Linear-chain CRFs to Regular Languages

In structured prediction, a major challenge for models is to represent t...
research
11/04/2018

Neural CRF transducers for sequence labeling

Conditional random fields (CRFs) have been shown to be one of the most s...
research
11/09/2019

Factored Latent-Dynamic Conditional Random Fields for Single and Multi-label Sequence Modeling

Conditional Random Fields (CRF) are frequently applied for labeling and ...
research
07/06/2018

Sum-Product Networks for Sequence Labeling

We consider higher-order linear-chain conditional random fields (HO-LC-C...
research
10/31/2018

GraphIE: A Graph-Based Framework for Information Extraction

Most modern Information Extraction (IE) systems are implemented as seque...
research
03/04/2015

Bethe Projections for Non-Local Inference

Many inference problems in structured prediction are naturally solved by...

Please sign up or login with your details

Forgot password? Click here to reset