A Girl Has A Name: Detecting Authorship Obfuscation

05/02/2020
by   Asad Mahmood, et al.
0

Authorship attribution aims to identify the author of a text based on the stylometric analysis. Authorship obfuscation, on the other hand, aims to protect against authorship attribution by modifying a text's style. In this paper, we evaluate the stealthiness of state-of-the-art authorship obfuscation methods under an adversarial threat model. An obfuscator is stealthy to the extent an adversary finds it challenging to detect whether or not a text modified by the obfuscator is obfuscated - a decision that is key to the adversary interested in authorship attribution. We show that the existing authorship obfuscation methods are not stealthy as their obfuscated texts can be identified with an average F1 score of 0.87. The reason for the lack of stealthiness is that these obfuscators degrade text smoothness, as ascertained by neural language models, in a detectable manner. Our results highlight the need to develop stealthy authorship obfuscation methods that can better protect the identity of an author seeking anonymity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2021

Avengers Ensemble! Improving Transferability of Authorship Obfuscation

Stylometric approaches have been shown to be quite effective for real-wo...
research
02/24/2016

Domain Specific Author Attribution Based on Feedforward Neural Network Language Models

Authorship attribution refers to the task of automatically determining t...
research
05/03/2014

Automated Attribution and Intertextual Analysis

In this work, we employ quantitative methods from the realm of statistic...
research
03/22/2022

A Girl Has A Name, And It's ... Adversarial Authorship Attribution for Deobfuscation

Recent advances in natural language processing have enabled powerful pri...
research
10/19/2022

Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective

Two interlocking research questions of growing interest and importance i...
research
06/10/2021

DT-grams: Structured Dependency Grammar Stylometry for Cross-Language Authorship Attribution

Cross-language authorship attribution problems rely on either translatio...
research
01/15/2021

Identifying Authorship Style in Malicious Binaries: Techniques, Challenges Datasets

Attributing a piece of malware to its creator typically requires threat ...

Please sign up or login with your details

Forgot password? Click here to reset