A Step Towards Interpretable Authorship Verification

06/22/2020
by   Oren Halvani, et al.
0

A central problem that has been researched for many years in the field of digital text forensics is the question whether two documents were written by the same author. Authorship verification (AV) is a research branch in this field that deals with this question. Over the years, research activities in the context of AV have steadily increased, which has led to a variety of approaches trying to solve this problem. Many of these approaches, however, make use of features that are related to or influenced by the topic of the documents. Therefore, it may accidentally happen that their verification results are based not on the writing style (the actual focus of AV), but on the topic of the documents. To address this problem, we propose an alternative AV approach that considers only topic-agnostic features in its classification decision. In addition, we present a post-hoc interpretation method that allows to understand which particular features have contributed to the prediction of the proposed AV method. To evaluate the performance of our AV method, we compared it with ten competing baselines (including the current state of the art) on four challenging data sets. The results show that our approach outperforms all baselines in two cases (with a maximum accuracy of 84 cases it performs as well as the strongest baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2020

An Improved Topic Masking Technique for Authorship Analysis

Authorship verification (AV) is an important sub-area of digital text fo...
research
06/24/2019

Assessing the Applicability of Authorship Verification Methods

Authorship verification (AV) is a research subject in the field of digit...
research
08/20/2019

Similarity Learning for Authorship Verification in Social Media

Authorship verification tries to answer the question if two documents wi...
research
03/17/2018

Experiments with Neural Networks for Small and Large Scale Authorship Verification

We propose two models for a special case of authorship verification prob...
research
12/31/2018

Unary and Binary Classification Approaches and their Implications for Authorship Verification

Retrieving indexed documents, not by their topical content but their wri...
research
09/03/2021

LG4AV: Combining Language Models and Graph Neural Networks for Author Verification

The automatic verification of document authorships is important in vario...

Please sign up or login with your details

Forgot password? Click here to reset