Accessibility and Trajectory-Based Text Characterization

01/17/2022
by   Bárbara C. e Souza, et al.
1

Several complex systems are characterized by presenting intricate characteristics extending along many scales. These characterizations are used in various applications, including text classification, better understanding of diseases, and comparison between cities, among others. In particular, texts are also characterized by a hierarchical structure that can be approached by using multi-scale concepts and methods. The present work aims at developing these possibilities while focusing on mesoscopic representations of networks. More specifically, we adopt an extension to the mesoscopic approach to represent text narratives, in which only the recurrent relationships among tagged parts of speech are considered to establish connections among sequential pieces of text (e.g., paragraphs). The characterization of the texts was then achieved by considering scale-dependent complementary methods: accessibility, symmetry and recurrence signatures. In order to evaluate the potential of these concepts and methods, we approached the problem of distinguishing between literary genres (fiction and non-fiction). A set of 300 books organized into the two genres was considered and were compared by using the aforementioned approaches. All the methods were capable of differentiating to some extent between the two genres. The accessibility and symmetry reflected the narrative asymmetries, while the recurrence signature provide a more direct indication about the non-sequential semantic connections taking place along the narrative.

READ FULL TEXT

page 7

page 10

research
06/30/2016

Representation of texts as complex networks: a mesoscopic approach

Statistical techniques that analyze texts, referred to as text analytics...
research
04/17/2019

Text Classification Algorithms: A Survey

In recent years, there has been an exponential growth in the number of c...
research
06/22/2018

Paragraph-based complex networks: application to document classification and authenticity verification

With the increasing number of texts made available on the Internet, many...
research
03/12/2016

Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks

Recent approaches based on artificial neural networks (ANNs) have shown ...
research
04/09/2015

Concentric network symmetry grasps authors' styles in word adjacency networks

Several characteristics of written texts have been inferred from statist...
research
02/24/2022

The effect of fatigue on the performance of online writer recognition

Background: The performance of biometric modalities based on things done...

Please sign up or login with your details

Forgot password? Click here to reset