HORAE: an annotated dataset of books of hours

12/01/2020
by   Mélodie Boillet, et al.
0

We introduce in this paper a new dataset of annotated pages from books of hours, a type of handwritten prayer books owned and used by rich lay people in the late middle ages. The dataset was created for conducting historical research on the evolution of the religious mindset in Europe at this period since the book of hours represent one of the major sources of information thanks both to their rich illustrations and the different types of religious sources they contain. We first describe how the corpus was collected and manually annotated then present the evaluation of a state-of-the-art system for text line detection and for zone detection and typing. The corpus is freely available for research.

READ FULL TEXT

page 1

page 3

page 4

page 6

research
06/25/2021

Manually Annotated Spelling Error Corpus for Amharic

This paper presents a manually annotated spelling error corpus for Amhar...
research
07/29/2023

ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus

We introduce the ÌròyìnSpeech corpus – a new dataset influenced by a des...
research
05/19/2023

DMDD: A Large-Scale Dataset for Dataset Mentions Detection

The recognition of dataset names is a critical task for automatic inform...
research
04/01/2020

Evolution and Transformation of Scientific Knowledge over the Sphaera Corpus: A Network Study

We investigated the evolution and transformation of scientific knowledge...
research
04/26/2023

SIMARA: a database for key-value information extraction from full pages

We propose a new database for information extraction from historical han...
research
02/26/2018

Publishing a Quality Context-aware Annotated Corpus and Lexicon for Harassment Research

Having a quality annotated corpus is essential especially for applied re...
research
11/09/2020

What time is it? Temporal Analysis of Novels

Recognizing the flow of time in a story is a crucial aspect of understan...

Please sign up or login with your details

Forgot password? Click here to reset