An Exploration of Arbitrary-Order Sequence Labeling via Energy-Based Inference Networks

10/06/2020
by   Lifu Tu, et al.
0

Many tasks in natural language processing involve predicting structured outputs, e.g., sequence labeling, semantic role labeling, parsing, and machine translation. Researchers are increasingly applying deep representation learning to these problems, but the structured component of these approaches is usually quite simplistic. In this work, we propose several high-order energy terms to capture complex dependencies among labels in sequence labeling, including several that consider the entire label sequence. We use neural parameterizations for these energy terms, drawing from convolutional, recurrent, and self-attention networks. We use the framework of learning energy-based inference networks (Tu and Gimpel, 2018) for dealing with the difficulties of training and inference with such models. We empirically demonstrate that this approach achieves substantial improvement using a variety of high-order energy terms on four sequence labeling tasks, while having the same decoding speed as simple, local classifiers. We also find high-order energies to help in noisy data conditions.

READ FULL TEXT

page 9

page 14

research
11/10/2020

Neural Latent Dependency Model for Sequence Labeling

Sequence labeling is a fundamental problem in machine learning, natural ...
research
11/22/2017

Does Higher Order LSTM Have Better Accuracy in Chunking and Named Entity Recognition?

Current researches usually employ single order setting by default when d...
research
08/27/2021

Learning Energy-Based Approximate Inference Networks for Structured Applications in NLP

Structured prediction in natural language processing (NLP) has a long hi...
research
10/31/2019

Graph Structured Prediction Energy Networks

For joint inference over multiple variables, a variety of structured pre...
research
12/25/2014

Gaussian Process Pseudo-Likelihood Models for Sequence Labeling

Several machine learning problems arising in natural language processing...
research
06/10/2019

Label-Agnostic Sequence Labeling by Copying Nearest Neighbors

Retrieve-and-edit based approaches to structured prediction, where struc...
research
07/05/2015

Parsimonious Labeling

We propose a new family of discrete energy minimization problems, which ...

Please sign up or login with your details

Forgot password? Click here to reset