Structured Prediction as Translation between Augmented Natural Languages

01/14/2021
by   Giovanni Paolini, et al.
7

We propose a new framework, Translation between Augmented Natural Languages (TANL), to solve many structured prediction language tasks including joint entity and relation extraction, nested named entity recognition, relation classification, semantic role labeling, event extraction, coreference resolution, and dialogue state tracking. Instead of tackling the problem by training task-specific discriminative classifiers, we frame it as a translation task between augmented natural languages, from which the task-relevant information can be easily extracted. Our approach can match or outperform task-specific models on all tasks, and in particular, achieves new state-of-the-art results on joint entity and relation extraction (CoNLL04, ADE, NYT, and ACE2005 datasets), relation classification (FewRel and TACRED), and semantic role labeling (CoNLL-2005 and CoNLL-2012). We accomplish this while using the same architecture and hyperparameters for all tasks and even when training a single model to solve all tasks at the same time (multi-task learning). Finally, we show that our framework can also significantly improve the performance in a low-resource regime, thanks to better use of label semantics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2022

DeepStruct: Pretraining of Language Models for Structure Prediction

We introduce a method for improving the structural understanding abiliti...
research
07/14/2023

Similarity-based Memory Enhanced Joint Entity and Relation Extraction

Document-level joint entity and relation extraction is a challenging inf...
research
09/15/2020

Augmented Natural Language for Generative Sequence Labeling

We propose a generative framework for joint sequence labeling and senten...
research
02/15/2020

Deeper Task-Specificity Improves Joint Entity and Relation Extraction

Multi-task learning (MTL) is an effective method for learning related ta...
research
08/27/2021

A Partition Filter Network for Joint Entity and Relation Extraction

In joint entity and relation extraction, existing work either sequential...
research
10/26/2022

Autoregressive Structured Prediction with Language Models

Recent years have seen a paradigm shift in NLP towards using pretrained ...
research
07/12/2017

Negative Sampling Improves Hypernymy Extraction Based on Projection Learning

We present a new approach to extraction of hypernyms based on projection...

Please sign up or login with your details

Forgot password? Click here to reset