Out-of-Domain Evaluation of Finnish Dependency Parsing

04/22/2022
by   Jenna Kanerva, et al.
0

The prevailing practice in the academia is to evaluate the model performance on in-domain evaluation data typically set aside from the training corpus. However, in many real world applications the data on which the model is applied may very substantially differ from the characteristics of the training data. In this paper, we focus on Finnish out-of-domain parsing by introducing a novel UD Finnish-OOD out-of-domain treebank including five very distinct data sources (web documents, clinical, online discussions, tweets, and poetry), and a total of 19,382 syntactic words in 2,122 sentences released under the Universal Dependencies framework. Together with the new treebank, we present extensive out-of-domain parsing evaluation utilizing the available section-level information from three different Finnish UD treebanks (TDT, PUD, OOD). Compared to the previously existing treebanks, the new Finnish-OOD is shown include sections more challenging for the general parser, creating an interesting evaluation setting and yielding valuable information for those applying the parser outside of its training domain.

READ FULL TEXT

page 7

page 8

research
01/11/2017

Parsing Universal Dependencies without training

We propose UDP, the first training-free parser for Universal Dependencie...
research
07/16/2021

POS tagging, lemmatization and dependency parsing of West Frisian

We present a lemmatizer/POS-tagger/dependency parser for West Frisian us...
research
02/24/2020

Resources for Turkish Dependency Parsing: Introducing the BOUN Treebank and the BoAT Annotation Tool

In this paper, we describe our contributions and efforts to develop Turk...
research
09/28/2022

Data-driven Parsing Evaluation for Child-Parent Interactions

We present a syntactic dependency treebank for naturalistic child and ch...
research
01/25/2023

Weakly Supervised Headline Dependency Parsing

English news headlines form a register with unique syntactic properties ...
research
10/13/2021

Compositional Generalization in Dependency Parsing

Compositionality, or the ability to combine familiar units like words in...
research
05/13/2023

Morpheus: Automated Safety Verification of Data-dependent Parser Combinator Programs

Parser combinators are a well-known mechanism used for the compositional...

Please sign up or login with your details

Forgot password? Click here to reset