TGIF: Tree-Graph Integrated-Format Parser for Enhanced UD with Two-Stage Generic- to Individual-Language Finetuning

07/14/2021
by   Tianze Shi, et al.
0

We present our contribution to the IWPT 2021 shared task on parsing into enhanced Universal Dependencies. Our main system component is a hybrid tree-graph parser that integrates (a) predictions of spanning trees for the enhanced graphs with (b) additional graph edges not present in the spanning trees. We also adopt a finetuning strategy where we first train a language-generic parser on the concatenation of data from all available languages, and then, in a second step, finetune on each individual language separately. Additionally, we develop our own complete set of pre-processing modules relevant to the shared task, including tokenization, sentence segmentation, and multiword token expansion, based on pre-trained XLM-R models and our own pre-training of character-level language models. Our submission reaches a macro-average ELAS of 89.24 on the test set. It ranks top among all teams, with a margin of more than 2 absolute ELAS over the next best-performing submission, and best score on 16 out of 17 languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2021

The DCU-EPFL Enhanced Dependency Parser at the IWPT 2021 Shared Task

We describe the DCU-EPFL submission to the IWPT 2021 Shared Task on Pars...
research
09/03/2020

The ADAPT Enhanced Dependency Parser at the IWPT 2020 Shared Task

We describe the ADAPT system for the 2020 IWPT Shared Task on parsing en...
research
10/23/2020

Graph-Based Universal Dependency Parsing in the Age of the Transformer: What Works, and What Doesn't

Current state-of-the-art graph-based dependency parsers differ on variou...
research
06/24/2021

Splitting EUD graphs into trees: A quick and clatty approach

We present the system submission from the FASTPARSE team for the EUD Sha...
research
11/07/2018

IMS at the PolEval 2018: A Bulky Ensemble Depedency Parser meets 12 Simple Rules for Predicting Enhanced Dependencies in Polish

This paper presents the IMS contribution to the PolEval 2018 Shared Task...
research
07/08/2021

COMBO: a new module for EUD parsing

We introduce the COMBO-based approach for EUD parsing and its implementa...
research
11/05/2020

Fast XML/HTML for Haskell: XML TypeLift

The paper presents and compares a range of parsers with and without data...

Please sign up or login with your details

Forgot password? Click here to reset