Possibilities, Challenges and Limits of a European Charters Corpus (Cartae Europae Medii Aevi - CEMA)

04/21/2021
by   Nicolas Perreaux, et al.
0

The objective of this paper is to present a meta-corpus of diplomatic documents entitled Cartae Europae Medii Aevi or CEMA. It shows the logic and limits of this meta-corpus, which contains 250,000 documents, by specifying both its structure and its possible future extensions. The second part of the paper is devoted to specific examples that will attempt to show the interest of such a database. The third part examine the possibilities opened up by the corpus in terms of historical semantics.

READ FULL TEXT
research
08/03/2020

Elsevier OA CC-By Corpus

We introduce the Elsevier OA CC-BY corpus. This is the first open corpus...
research
07/27/2018

Ethnographie de la structuration d'un corpus collectif de messages de soutien social en ligne

In this paper, we propose a study of progressive development of the stru...
research
02/02/2017

Topic Modeling the Hàn diăn Ancient Classics

Ancient Chinese texts present an area of enormous challenge and opportun...
research
07/20/2019

Towards meta-interpretive learning of programming language semantics

We introduce a new application for inductive logic programming: learning...
research
04/06/2020

An Annotated Corpus of Emerging Anglicisms in Spanish Newspaper Headlines

The extraction of anglicisms (lexical borrowings from English) is releva...
research
12/15/2022

The Effects of Character-Level Data Augmentation on Style-Based Dating of Historical Manuscripts

Identifying the production dates of historical manuscripts is one of the...
research
12/28/2017

Corpus specificity in LSA and Word2vec: the role of out-of-domain documents

Latent Semantic Analysis (LSA) and Word2vec are some of the most widely ...

Please sign up or login with your details

Forgot password? Click here to reset