Discontinuous Grammar as a Foreign Language

In order to achieve deep natural language understanding, syntactic constituent parsing is a vital step, highly demanded by many artificial intelligence systems to process both text and speech. One of the most recent proposals is the use of standard sequence-to-sequence models to perform constituent parsing as a machine translation task, instead of applying task-specific parsers. While they show a competitive performance, these text-to-parse transducers are still lagging behind classic techniques in terms of accuracy, coverage and speed. To close the gap, we here extend the framework of sequence-to-sequence models for constituent parsing, not only by providing a more powerful neural architecture for improving their performance, but also by enlarging their coverage to handle the most complex syntactic phenomena: discontinuous structures. To that end, we design several novel linearizations that can fully produce discontinuities and, for the first time, we test a sequence-to-sequence model on the main discontinuous benchmarks, obtaining competitive results on par with task-specific discontinuous constituent parsers and achieving state-of-the-art scores on the (discontinuous) English Penn Treebank.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2020

Discontinuous Constituent Parsing with Pointer Networks

One of the most complex syntactic representations used in computational ...
research
12/23/2014

Grammar as a Foreign Language

Syntactic constituency parsing is a fundamental problem in natural langu...
research
04/13/2021

Reducing Discontinuous to Continuous Parsing with Pointer Network Reordering

Discontinuous constituent parsers have always lagged behind continuous a...
research
05/27/2020

Enriched In-Order Linearization for Faster Sequence-to-Sequence Constituent Parsing

Sequence-to-sequence constituent parsing requires a linearization to rep...
research
08/14/2018

Two Local Models for Neural Constituent Parsing

Non-local features have been exploited by syntactic parsers for capturin...
research
09/21/2020

Multitask Pointer Network for Multi-Representational Parsing

We propose a transition-based approach that, by training a single model,...
research
10/07/2019

Controllable Sentence Simplification

Text simplification aims at making a text easier to read and understand ...

Please sign up or login with your details

Forgot password? Click here to reset