Autoregressive Structured Prediction with Language Models

10/26/2022
by   Tianyu Liu, et al.
0

Recent years have seen a paradigm shift in NLP towards using pretrained language models (PLM) for a wide range of tasks. However, there are many difficult design decisions to represent structures (e.g. tagged text, coreference chains) in a way such that they can be captured by PLMs. Prior work on structured prediction with PLMs typically flattens the structured output into a sequence, which limits the quality of structural information being learned and leads to inferior performance compared to classic discriminative models. In this work, we describe an approach to model structures as sequences of actions in an autoregressive manner with PLMs, allowing in-structure dependencies to be learned without any loss. Our approach achieves the new state-of-the-art on all the structured prediction tasks we looked at, namely, named entity recognition, end-to-end relation extraction, and coreference resolution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2022

Prompting Language Models for Linguistic Structure

Although pretrained language models (PLMs) can be prompted to perform a ...
research
05/21/2022

DeepStruct: Pretraining of Language Models for Structure Prediction

We introduce a method for improving the structural understanding abiliti...
research
12/20/2019

End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language Models

Named entity recognition (NER) and relation extraction (RE) are two impo...
research
07/28/2022

Efficient Training of Language Models to Fill in the Middle

We show that autoregressive language models can learn to infill text aft...
research
01/14/2021

Structured Prediction as Translation between Augmented Natural Languages

We propose a new framework, Translation between Augmented Natural Langua...
research
06/15/2022

Contextualization and Generalization in Entity and Relation Extraction

During the past decade, neural networks have become prominent in Natural...
research
05/08/2023

Revisiting Relation Extraction in the era of Large Language Models

Relation extraction (RE) is the core NLP task of inferring semantic rela...

Please sign up or login with your details

Forgot password? Click here to reset