On the "Calligraphy" of Books

05/29/2017
by   Vanessa Q. Marinho, et al.
0

Authorship attribution is a natural language processing task that has been widely studied, often by considering small order statistics. In this paper, we explore a complex network approach to assign the authorship of texts based on their mesoscopic representation, in an attempt to capture the flow of the narrative. Indeed, as reported in this work, such an approach allowed the identification of the dominant narrative structure of the studied authors. This has been achieved due to the ability of the mesoscopic approach to take into account relationships between different, not necessarily adjacent, parts of the text, which is able to capture the story flow. The potential of the proposed approach has been illustrated through principal component analysis, a comparison with the chance baseline method, and network visualization. Such visualizations reveal individual characteristics of the authors, which can be understood as a kind of calligraphy.

READ FULL TEXT

page 6

page 7

research
08/16/2018

Linguistic data mining with complex networks: a stylometric-oriented approach

By representing a text by a set of words and their co-occurrences, one o...
research
07/23/2016

Authorship attribution via network motifs identification

Concepts and methods of complex networks can be used to analyse texts at...
research
05/01/2017

Labelled network subgraphs reveal stylistic subtleties in written texts

The vast amount of data and increase of computational capacity have allo...
research
09/29/2021

Reflexivity in Issues of Scale and Representation in a Digital Humanities Project

In this paper, we explore issues that we have encountered in developing ...
research
07/16/2021

An Automated Approach to Reasoning About Task-Oriented Insights in Responsive Visualization

Authors often transform a large screen visualization for smaller display...
research
05/11/2017

On the role of words in the network structure of texts: application to authorship attribution

Well-established automatic analyses of texts mainly consider frequencies...

Please sign up or login with your details

Forgot password? Click here to reset