A complex network approach to stylometry

06/30/2015
by   Diego R. Amancio, et al.
0

Statistical methods have been widely employed to study the fundamental properties of language. In recent years, methods from complex and dynamical systems proved useful to create several language models. Despite the large amount of studies devoted to represent texts with physical models, only a limited number of studies have shown how the properties of the underlying physical systems can be employed to improve the performance of natural language processing tasks. In this paper, I address this problem by devising complex networks methods that are able to improve the performance of current statistical methods. Using a fuzzy classification strategy, I show that the topological properties extracted from texts complement the traditional textual description. In several cases, the performance obtained with hybrid approaches outperformed the results obtained when only traditional or networked methods were used. Because the proposed model is generic, the framework devised here could be straightforwardly used to study similar textual applications where the topology plays a pivotal role in the description of the interacting agents.

READ FULL TEXT
research
02/04/2015

Authorship recognition via fluctuation analysis of network topology and word intermittency

Statistical methods have been widely employed in many practical natural ...
research
07/28/2015

Classifying informative and imaginative prose using complex networks

Statistical methods have been widely employed in recent years to grasp m...
research
12/29/2014

Probing the topological properties of complex networks modeling short written texts

In recent years, graph theory has been widely employed to probe several ...
research
03/13/2020

Using word embeddings to improve the discriminability of co-occurrence text networks

Word co-occurrence networks have been employed to analyze texts both in ...
research
06/25/2016

Word sense disambiguation via bipartite representation of complex networks

In recent years, concepts and methods of complex networks have been empl...
research
07/16/2019

Language comparison via network topology

Modeling relations between languages can offer understanding of language...
research
10/20/2016

Authorship Attribution Based on Life-Like Network Automata

The authorship attribution is a problem of considerable practical and te...

Please sign up or login with your details

Forgot password? Click here to reset