Latin writing styles analysis with Machine Learning: New approach to old questions

09/01/2021
by   Arianna Di Bernardo, et al.
0

In the Middle Ages texts were learned by heart and spread using oral means of communication from generation to generation. Adaptation of the art of prose and poems allowed keeping particular descriptions and compositions characteristic for many literary genres. Taking into account such a specific construction of literature composed in Latin, we can search for and indicate the probability patterns of familiar sources of specific narrative texts. Consideration of Natural Language Processing tools allowed us the transformation of textual objects into numerical ones and then application of machine learning algorithms to extract information from the dataset. We carried out the task consisting of the practical use of those concepts and observation to create a tool for analyzing narrative texts basing on open-source databases. The tool focused on creating specific search tools resources which could enable us detailed searching throughout the text. The main objectives of the study take into account finding similarities between sentences and between documents. Next, we applied machine learning algorithms on chosen texts to calculate specific features of them (for instance authorship or centuries) and to recognize sources of anonymous texts with a certain percentage.

READ FULL TEXT
research
02/04/2015

Authorship recognition via fluctuation analysis of network topology and word intermittency

Statistical methods have been widely employed in many practical natural ...
research
10/22/2020

Method of noun phrase detection in Ukrainian texts

Introduction. The area of natural language processing considers AI-compl...
research
08/11/2022

Searching for chromate replacements using natural language processing and machine learning algorithms

The past few years has seen the application of machine learning utilised...
research
02/28/2020

Fast Indexes for Gapped Pattern Matching

We describe indexes for searching large data sets for variable-length-ga...
research
01/14/2021

Estimation of the Frequency of Occurrence of Italian Phonemes in Text

The purpose of this project was to derive a reliable estimate of the fre...
research
08/15/2022

SynKB: Semantic Search for Synthetic Procedures

In this paper we present SynKB, an open-source, automatically extracted ...

Please sign up or login with your details

Forgot password? Click here to reset