Analysis of the Penn Korean Universal Dependency Treebank (PKT-UD): Manual Revision to Build Robust Parsing Model in Korean

05/26/2020
by   Tae Hwan Oh, et al.
0

In this paper, we first open on important issues regarding the Penn Korean Universal Treebank (PKT-UD) and address these issues by revising the entire corpus manually with the aim of producing cleaner UD annotations that are more faithful to Korean grammar. For compatibility to the rest of UD corpora, we follow the UDv2 guidelines, and extensively revise the part-of-speech tags and the dependency relations to reflect morphological features and flexible word-order aspects in Korean. The original and the revised versions of PKT-UD are experimented with transformer-based parsing models using biaffine attention. The parsing model trained on the revised corpus shows a significant improvement of 3.0 previous corpus. Our error analysis demonstrates that this revision allows the parsing model to learn relations more robustly, reducing several critical errors that used to be made by the previous model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2018

82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models

We present the Uppsala system for the CoNLL 2018 Shared Task on universa...
research
05/13/2016

Universal Dependencies for Learner English

We introduce the Treebank of Learner English (TLE), the first publicly a...
research
07/16/2021

POS tagging, lemmatization and dependency parsing of West Frisian

We present a lemmatizer/POS-tagger/dependency parser for West Frisian us...
research
12/24/2020

ThamizhiUDp: A Dependency Parser for Tamil

This paper describes how we developed a neural-based dependency parser, ...
research
09/20/2022

Yet Another Format of Universal Dependencies for Korean

In this study, we propose a morpheme-based scheme for Korean dependency ...
research
10/15/2018

Marrying Universal Dependencies and Universal Morphology

The Universal Dependencies (UD) and Universal Morphology (UniMorph) proj...
research
05/24/2023

Another Dead End for Morphological Tags? Perturbed Inputs and Parsing

The usefulness of part-of-speech tags for parsing has been heavily quest...

Please sign up or login with your details

Forgot password? Click here to reset