Datasets and Models for Authorship Attribution on Italian Personal Writings

11/16/2020
by   Gaetana Ruggiero, et al.
0

Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is available (e.g novels), mainly in English. We approach AA via Authorship Verification on short Italian texts in two novel datasets, and analyze the interaction between genre, topic, gender and length. Results show that AV is feasible even with little data, but more evidence helps. Gender and topic can be indicative clues, and if not controlled for, they might overtake more specific aspects of personal style.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2022

The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts

We present the Project Dialogism Novel Corpus, or PDNC, an annotated dat...
research
12/02/2020

Analyzing Stylistic Variation across Different Political Regimes

In this article we propose a stylistic analysis of texts written across ...
research
12/26/2018

An Investigation of Supervised Learning Methods for Authorship Attribution in Short Hinglish Texts using Char & Word N-grams

The writing style of a person can be affirmed as a unique identity indic...
research
01/30/2020

Authorship Attribution of Source Code: A Language-Agnostic Approach and Applicability in Software Engineering

Authorship attribution of source code has been an established research t...
research
12/12/2016

Unraveling reported dreams with text analytics

We investigate what distinguishes reported dreams from other personal na...
research
08/19/2017

Measuring the Effect of Discourse Relations on Blog Summarization

The work presented in this paper attempts to evaluate and quantify the u...
research
03/24/2020

Forensic Authorship Analysis of Microblogging Texts Using N-Grams and Stylometric Features

In recent years, messages and text posted on the Internet are used in cr...

Please sign up or login with your details

Forgot password? Click here to reset