Integrated Sequence Tagging for Medieval Latin Using Deep Representation Learning

03/04/2016
by   Mike Kestemont, et al.
0

In this paper we consider two sequence tagging tasks for medieval Latin: part-of-speech tagging and lemmatization. These are both basic, yet foundational preprocessing steps in applications such as text re-use detection. Nevertheless, they are generally complicated by the considerable orthographic variation which is typical of medieval Latin. In Digital Classics, these tasks are traditionally solved in a (i) cascaded and (ii) lexicon-dependent fashion. For example, a lexicon is used to generate all the potential lemma-tag pairs for a token, and next, a context-aware PoS-tagger is used to select the most appropriate tag-lemma pair. Apart from the problems with out-of-lexicon items, error percolation is a major downside of such approaches. In this paper we explore the possibility to elegantly solve these tasks using a single, integrated approach. For this, we make use of a layered neural network architecture from the field of deep representation learning.

READ FULL TEXT
research
03/31/2021

Joint Khmer Word Segmentation and Part-of-Speech Tagging Using Deep Learning

Khmer text is written from left to right with optional space. Space is n...
research
11/14/2017

From Word Segmentation to POS Tagging for Vietnamese

This paper presents an empirical comparison of two strategies for Vietna...
research
05/28/2021

Cisco at SemEval-2021 Task 5: What's Toxic?: Leveraging Transformers for Multiple Toxic Span Extraction from Online Comments

Social network platforms are generally used to share positive, construct...
research
03/06/2018

The Impact of Semantic Context Cues on the User Acceptance of Tag Recommendations: An Online Study

In this paper, we present the results of an online study with the aim to...
research
03/18/2017

Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks

Recent papers have shown that neural networks obtain state-of-the-art pe...
research
04/29/2018

Sequence Tagging with Policy-Value Networks and Tree Search

In this paper we propose a novel reinforcement learning based model for ...
research
11/29/2018

Multi-Scale Distributed Representation for Deep Learning and its Application to b-Jet Tagging

Recently machine learning algorithms based on deep layered artificial ne...

Please sign up or login with your details

Forgot password? Click here to reset