Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n^6) down to O(n^3)

03/30/2020
by   Caio Corro, et al.
0

We introduce a novel chart-based algorithm for span-based parsing of discontinuous constituency trees of block degree two, including ill-nested structures. In particular, we show that we can build variants of our parser with smaller search spaces and time complexities ranging from O(n^6) down to O(n^3). The cubic time variant covers 98% of constituents observed in linguistic treebanks while having the same complexity as continuous constituency parsers. We evaluate our approach on German and English treebanks (Negra, Tiger and Discontinuous PTB) and report state-of-the-art results in the fully supervised setting. We also experiment with pre-trained word embeddings and -based neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2019

Discontinuous Constituency Parsing with a Stack-Free Transition System and a Dynamic Oracle

We introduce a novel transition system for discontinuous constituency pa...
research
02/24/2019

Unlexicalized Transition-based Discontinuous Constituency Parsing

Lexicalized parsing models are based on the assumptions that (i) constit...
research
10/20/2020

Supertagging-based Parsing with Linear Context-free Rewriting Systems

We present the first supertagging-based parser for LCFRS. It utilizes ne...
research
04/13/2021

Reducing Discontinuous to Continuous Parsing with Pointer Network Reordering

Discontinuous constituent parsers have always lagged behind continuous a...
research
09/21/2020

Multitask Pointer Network for Multi-Representational Parsing

We propose a transition-based approach that, by training a single model,...
research
03/11/2019

HLT@SUDA at SemEval 2019 Task 1: UCCA Graph Parsing as Constituent Tree Parsing

This paper describes a simple UCCA semantic graph parsing approach. The ...

Please sign up or login with your details

Forgot password? Click here to reset