Avengers Ensemble! Improving Transferability of Authorship Obfuscation

09/15/2021
by   Muhammad Haroon, et al.
0

Stylometric approaches have been shown to be quite effective for real-world authorship attribution. To mitigate the privacy threat posed by authorship attribution, researchers have proposed automated authorship obfuscation approaches that aim to conceal the stylometric artefacts that give away the identity of an anonymous document's author. Recent work has focused on authorship obfuscation approaches that rely on black-box access to an attribution classifier to evade attribution while preserving semantics. However, to be useful under a realistic threat model, it is important that these obfuscation approaches work well even when the adversary's attribution classifier is different from the one used internally by the obfuscator. Unfortunately, existing authorship obfuscation approaches do not transfer well to unseen attribution classifiers. In this paper, we propose an ensemble-based approach for transferable authorship obfuscation. Our experiments show that if an obfuscator can evade an ensemble attribution classifier, which is based on multiple base attribution classifiers, it is more likely to transfer to different attribution classifiers. Our analysis shows that ensemble-based authorship obfuscation achieves better transferability because it combines the knowledge from each of the base attribution classifiers by essentially averaging their decision boundaries.

READ FULL TEXT

page 9

page 10

research
03/22/2022

A Girl Has A Name, And It's ... Adversarial Authorship Attribution for Deobfuscation

Recent advances in natural language processing have enabled powerful pri...
research
05/02/2020

A Girl Has A Name: Detecting Authorship Obfuscation

Authorship attribution aims to identify the author of a text based on th...
research
11/08/2017

Picasso, Matisse, or a Fake? Automated Analysis of Drawings at the Stroke Level for Attribution and Authentication

This paper proposes a computational approach for analysis of strokes in ...
research
02/18/2020

Camera Model Anonymisation with Augmented cGANs

The model of camera that was used to capture a particular photographic i...
research
12/09/2020

On an Unknown Ancestor of Burrows' Delta Measure

This article points out some surprising similarities between a 1944 stud...
research
09/30/2014

An agent-driven semantical identifier using radial basis neural networks and reinforcement learning

Due to the huge availability of documents in digital form, and the decep...
research
11/26/2019

CAWA: An Attention-Network for Credit Attribution

Credit attribution is the task of associating individual parts in a docu...

Please sign up or login with your details

Forgot password? Click here to reset