Defeating Author Gender Identification with Text Style Transfer

09/02/2020
by   Reza Khan Mohammadi, et al.
0

Text Style Transfer can be named as one of the most important Natural Language Processing tasks. Up until now, there have been several approaches and methods experimented for this purpose. In this work, we introduce PGST, a novel polyglot text style transfer approach in gender domain composed of different building blocks. If they become fulfilled with required elements, our method can be applied in multiple languages. We have proceeded with a pre-trained word embedding for token replacement purposes, a character-based token classifier for gender exchange purposes, and the beam search algorithm for extracting the most fluent combination among all suggestions. Since different approaches are introduced in our research, we determine a trade-off value for evaluating different models' success in faking our gender identification model with transferred text. To demonstrate our method's multilingual applicability, we applied our method on both English and Persian corpora and finally ended up defeating our proposed gender identification model by 45.6 respectively, and obtained highly competitive evaluation results in an analogy among English state of the art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2022

Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer

We exploit the pre-trained seq2seq model mBART for multilingual text sty...
research
05/06/2020

Review of text style transfer based on deep learning

Text style transfer is a hot issue in recent natural language processing...
research
08/25/2019

Transforming Delete, Retrieve, Generate Approach for Controlled Text Style Transfer

Text style transfer is the task of transferring the style of text having...
research
10/14/2021

Few-shot Controllable Style Transfer for Low-Resource Settings: A Study in Indian Languages

Style transfer is the task of rewriting an input sentence into a target ...
research
06/20/2022

Studying the role of named entities for content preservation in text style transfer

Text style transfer techniques are gaining popularity in Natural Languag...
research
09/19/2021

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation

Documents as short as a single sentence may inadvertently reveal sensiti...
research
09/28/2021

How Different Text-preprocessing Techniques Using The BERT Model Affect The Gender Profiling of Authors

Forensic author profiling plays an important role in indicating possible...

Please sign up or login with your details

Forgot password? Click here to reset