Unsupervised authorship attribution

03/26/2015
by   David Fifield, et al.
0

We describe a technique for attributing parts of a written text to a set of unknown authors. Nothing is assumed to be known a priori about the writing styles of potential authors. We use multiple independent clusterings of an input text to identify parts that are similar and dissimilar to one another. We describe algorithms necessary to combine the multiple clusterings into a meaningful output. We show results of the application of the technique on texts having multiple writing styles.

READ FULL TEXT

page 9

page 10

page 11

research
05/01/2023

A Computational Model For Individual Scholars' Writing Style Dynamics

A manuscript's writing style is central in determining its readership, i...
research
08/16/2018

Linguistic data mining with complex networks: a stylometric-oriented approach

By representing a text by a set of words and their co-occurrences, one o...
research
12/19/2014

Multiple Authors Detection: A Quantitative Analysis of Dream of the Red Chamber

Inspired by the authorship controversy of Dream of the Red Chamber and t...
research
05/17/2019

D2d -- XML for Authors

D2d is an input format which allows experienced authors to create type c...
research
03/03/2023

Who could be behind QAnon? Authorship attribution with supervised machine-learning

A series of social media posts signed under the pseudonym "Q", started a...
research
11/09/2022

A comparison of several AI techniques for authorship attribution on Romanian texts

Determining the author of a text is a difficult task. Here we compare mu...
research
08/15/2022

Reproduction and Replication of an Adversarial Stylometry Experiment

Maintaining anonymity while communicating using natural language remains...

Please sign up or login with your details

Forgot password? Click here to reset