Methods for Detoxification of Texts for the Russian Language

05/19/2021
by   Daryna Dementieva, et al.
0

We introduce the first study of automatic detoxification of Russian texts to combat offensive language. Such a kind of textual style transfer can be used, for instance, for processing toxic content in social media. While much work has been done for the English language in this field, it has never been solved for the Russian language yet. We test two types of models - unsupervised approach based on BERT architecture that performs local corrections and supervised approach based on pretrained language GPT-2 model - and compare them with several baselines. In addition, we describe evaluation setup providing training datasets and metrics for automatic evaluation. The results show that the tested approaches can be successfully used for detoxification, although there is room for improvement.

READ FULL TEXT
research
11/01/2020

Towards A Friendly Online Community: An Unsupervised Style Transfer Framework for Profanity Redaction

Offensive and abusive language is a pressing problem on social media pla...
research
05/03/2022

Themes of Revenge: Automatic Identification of Vengeful Content in Textual Data

Revenge is a powerful motivating force reported to underlie the behavior...
research
05/20/2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer

We introduce a new approach to tackle the problem of offensive language ...
research
05/18/2022

Exploiting Social Media Content for Self-Supervised Style Transfer

Recent research on style transfer takes inspiration from unsupervised ne...
research
01/21/2022

Text Style Transfer for Bias Mitigation using Masked Language Modeling

It is well known that textual data on the internet and other digital pla...
research
10/24/2020

On Learning Text Style Transfer with Direct Rewards

In most cases, the lack of parallel corpora makes it impossible to direc...
research
09/10/2021

Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework

Style is an integral part of natural language. However, evaluation metho...

Please sign up or login with your details

Forgot password? Click here to reset