UDPipe at SIGMORPHON 2019: Contextualized Embeddings, Regularization with Morphological Categories, Corpora Merging

08/19/2019
by   Milan Straka, et al.
0

We present our contribution to the SIGMORPHON 2019 Shared Task: Crosslinguality and Context in Morphology, Task 2: contextual morphological analysis and lemmatization. We submitted a modification of the UDPipe 2.0, one of best-performing systems of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies and an overall winner of the The 2018 Shared Task on Extrinsic Parser Evaluation. As our first improvement, we use the pretrained contextualized embeddings (BERT) as additional inputs to the network; secondly, we use individual morphological features as regularization; and finally, we merge the selected corpora of the same language. In the lemmatization task, our system exceeds all the submitted systems by a wide margin with lemmatization accuracy 95.78 (second best was 95.00, third 94.46). In the morphological analysis, our system placed tightly second: our morphological analysis accuracy was 93.19, the winning system's 93.23.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2018

Copenhagen at CoNLL--SIGMORPHON 2018: Multilingual Inflection in Context with Explicit Morphosyntactic Decoding

This paper documents the Team Copenhagen system which placed first in th...
research
08/20/2019

Evaluating Contextualized Embeddings on 54 Languages in POS Tagging, Lemmatization and Dependency Parsing

We present an extensive evaluation of three recently proposed methods fo...
research
01/30/2020

LowResourceEval-2019: a shared task on morphological analysis for low-resource languages

The paper describes the results of the first shared task on morphologica...
research
06/05/2020

UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings

We present our contribution to the EvaLatin shared task, which is the fi...
research
09/15/2018

Finding the way from ä to a: Sub-character morphological inflection for the SIGMORPHON 2018 Shared Task

In this paper we describe the system submitted by UHH to the CoNLL--SIGM...
research
10/16/2018

The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection

The CoNLL--SIGMORPHON 2018 shared task on supervised learning of morphol...
research
06/09/2023

Morphosyntactic probing of multilingual BERT models

We introduce an extensive dataset for multilingual probing of morphologi...

Please sign up or login with your details

Forgot password? Click here to reset