Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers

04/16/2020
by   Robert Litschko, et al.
0

Current methods of cross-lingual parser transfer focus on predicting the best parser for a low-resource target language globally, that is, "at treebank level". In this work, we propose and argue for a novel cross-lingual transfer paradigm: instance-level parser selection (ILPS), and present a proof-of-concept study focused on instance-level selection in the framework of delexicalized parser transfer. We start from an empirical observation that different source parsers are the best choice for different Universal POS sequences in the target language. We then propose to predict the best parser at the instance level. To this end, we train a supervised regression model, based on the Transformer architecture, to predict parser accuracies for individual POS-sequences. We compare ILPS against two strong single-best parser selection baselines (SBPS): (1) a model that compares POS n-gram distributions between the source and target languages (KL) and (2) a model that selects the source based on the similarity between manually created language vectors encoding syntactic properties of languages (L2V). The results from our extensive evaluation, coupling 42 source parsers and 20 diverse low-resource test languages, show that ILPS outperforms KL and L2V on 13/20 and 14/20 test languages, respectively. Further, we show that by predicting the best parser "at the treebank level" (SBPS), using the aggregation of predictions from our instance-level model, we outperform the same baselines on 17/20 and 16/20 test languages.

READ FULL TEXT

page 3

page 4

page 6

research
01/27/2021

PPT: Parsimonious Parser Transfer for Unsupervised Cross-Lingual Adaptation

Cross-lingual transfer is a leading technique for parsing low-resource l...
research
08/09/2023

Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach

Constituency parsing plays a fundamental role in advancing natural langu...
research
11/05/2020

Fast XML/HTML for Haskell: XML TypeLift

The paper presents and compares a range of parsers with and without data...
research
09/20/2019

Cross-lingual Dependency Parsing with Unlabeled Auxiliary Languages

Cross-lingual transfer learning has become an important weapon to battle...
research
02/27/2020

Improving cross-lingual model transfer by chunking

We present a shallow parser guided cross-lingual model transfer approach...
research
10/10/2017

The Galactic Dependencies Treebanks: Getting More Data by Synthesizing New Languages

We release Galactic Dependencies 1.0---a large set of synthetic language...
research
05/24/2022

Universal Dependency Treebank for Odia Language

This paper presents the first publicly available treebank of Odia, a mor...

Please sign up or login with your details

Forgot password? Click here to reset