Not All Linearizations Are Equally Data-Hungry in Sequence Labeling Parsing

08/17/2021
by   Alberto Muñoz-Ortiz, et al.
0

Different linearizations have been proposed to cast dependency parsing as sequence labeling and solve the task as: (i) a head selection problem, (ii) finding a representation of the token arcs as bracket strings, or (iii) associating partial transition sequences of a transition-based parser to words. Yet, there is little understanding about how these linearizations behave in low-resource setups. Here, we first study their data efficiency, simulating data-restricted setups from a diverse set of rich-resource treebanks. Second, we test whether such differences manifest in truly low-resource setups. The results show that head selection encodings are more data-efficient and perform better in an ideal (gold) framework, but that such advantage greatly vanishes in favour of bracketing formats when the running setup resembles a real-world low-resource configuration.

READ FULL TEXT

page 6

page 7

research
10/27/2022

Parsing linearizations appreciate PoS tags - but some are fussy about errors

PoS tags, once taken for granted as a useful resource for syntactic pars...
research
02/12/2021

A Little Pretraining Goes a Long Way: A Case Study on Dependency Parsing Task for Low-resource Morphologically Rich Languages

Neural dependency parsing has achieved remarkable performance for many d...
research
11/01/2020

A Unifying Theory of Transition-based and Sequence Labeling Parsing

We define a mapping from transition-based parsing algorithms that read s...
research
11/11/2019

Deep Contextualized Self-training for Low Resource Dependency Parsing

Neural dependency parsing has proven very effective, achieving state-of-...
research
11/12/2020

Exploiting Cross-Dialectal Gold Syntax for Low-Resource Historical Languages: Towards a Generic Parser for Pre-Modern Slavic

This paper explores the possibility of improving the performance of spec...
research
09/29/2022

Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing

A common recent approach to semantic parsing augments sequence-to-sequen...
research
06/26/2019

Eliciting Knowledge from Experts:Automatic Transcript Parsing for Cognitive Task Analysis

Cognitive task analysis (CTA) is a type of analysis in applied psycholog...

Please sign up or login with your details

Forgot password? Click here to reset