Assessing the Applicability of Authorship Verification Methods

06/24/2019
by   Oren Halvani, et al.
0

Authorship verification (AV) is a research subject in the field of digital text forensics that concerns itself with the question, whether two documents have been written by the same person. During the past two decades, an increasing number of proposed AV approaches can be observed. However, a closer look at the respective studies reveals that the underlying characteristics of these methods are rarely addressed, which raises doubts regarding their applicability in real forensic settings. The objective of this paper is to fill this gap by proposing clear criteria and properties that aim to improve the characterization of existing and future AV approaches. Based on these properties, we conduct three experiments using 12 existing AV approaches, including the current state of the art. The examined methods were trained, optimized and evaluated on three self-compiled corpora, where each corpus focuses on a different aspect of applicability. Our results indicate that part of the methods are able to cope with very challenging verification cases such as 250 characters long informal chat conversations (72.7 which two scientific documents were written at different times with an average difference of 15.6 years (> 75 involved methods are prone to cross-topic verification cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/31/2018

Unary and Binary Classification Approaches and their Implications for Authorship Verification

Retrieving indexed documents, not by their topical content but their wri...
research
05/02/2020

An Improved Topic Masking Technique for Authorship Analysis

Authorship verification (AV) is an important sub-area of digital text fo...
research
06/22/2020

A Step Towards Interpretable Authorship Verification

A central problem that has been researched for many years in the field o...
research
08/20/2019

Similarity Learning for Authorship Verification in Social Media

Authorship verification tries to answer the question if two documents wi...
research
07/24/2023

Towards Generalising Neural Topical Representations

Topic models have evolved from conventional Bayesian probabilistic model...
research
06/25/2021

On Preserving the Behavior in Software Refactoring: A Systematic Mapping Study

Context: Refactoring is the art of modifying the design of a system with...
research
11/16/2021

RemoteVote and SAFE Vote: Towards Usable End-to-End Verification for Vote-by-Mail

Postal voting is growing rapidly in the U.S., with 43 ballots by mail in...

Please sign up or login with your details

Forgot password? Click here to reset