Corpus and Models for Lemmatisation and POS-tagging of Old French

09/23/2021
by   Jean-Baptiste Camps, et al.
25

Old French is a typical example of an under-resourced historic languages, that furtherly displays animportant amount of linguistic variation. In this paper, we present the current results of a long going project (2015-...) and describe how we broached the difficult question of providing lemmatisation andPOS models for Old French with the help of neural taggers and the progressive constitution of dedicated corpora.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2020

The VC-dimension of k-vertex d-polytopes

In this short note, we show that the VC-dimension of the class of k-vert...
research
10/28/2019

Reply to: Large-scale quantitative profiling of the Old English verse tradition

In Nature Human Behaviour 3/2019, an article was published entitled "Lar...
research
02/21/2022

Flexible Skylines: Customizing Skyline Queries Catching Desired Preferences

The techniques most extensively used to retrieve interesting data from d...
research
11/16/2020

A Probabilistic Approach in Historical Linguistics Word Order Change in Infinitival Clauses: from Latin to Old French

This research offers a new interdisciplinary approach to the field of Li...
research
01/30/2018

Manuscripts in Time and Space: Experiments in Scriptometrics on an Old French Corpus

Witnesses of medieval literary texts, preserved in manuscript, are layer...
research
04/26/2019

Producing Corpora of Medieval and Premodern Occitan

At a time when the quantity of - more or less freely - available data is...
research
12/22/2021

Investigating the 'old boy network' using latent space models

This paper investigates the nature of institutional ties between a group...

Please sign up or login with your details

Forgot password? Click here to reset