Conversational Networks for Automatic Online Moderation

01/31/2019
by   Etienne Papegnies, et al.
0

Moderation of user-generated content in an online community is a challenge that has great socio-economical ramifications. However, the costs incurred by delegating this work to human agents are high. For this reason, an automatic system able to detect abuse in user-generated content is of great interest. There are a number of ways to tackle this problem, but the most commonly seen in practice are word filtering or regular expression matching. The main limitations are their vulnerability to intentional obfuscation on the part of the users, and their context-insensitive nature. Moreover, they are language-dependent and may require appropriate corpora for training. In this paper, we propose a system for automatic abuse detection that completely disregards message content. We first extract a conversational network from raw chat logs and characterize it through topological measures. We then use these as features to train a classifier on our abuse detection task. We thoroughly assess our system on a dataset of user comments originating from a French Massively Multiplayer Online Game. We identify the most appropriate network extraction parameters and discuss the discriminative power of our features, relatively to their topological and temporal nature. Our method reaches an F-measure of 83.89 when using the full feature set, improving on existing approaches. With a selection of the most discriminative features, we dramatically cut computing time while retaining most of the performance (82.65).

READ FULL TEXT

page 1

page 18

research
05/20/2019

Abusive Language Detection in Online Conversations by Combining Content-and Graph-based Features

In recent years, online social networks have allowed worldwide users to ...
research
12/29/2020

Can You be More Social? Injecting Politeness and Positivity into Task-Oriented Conversational Agents

Goal-oriented conversational agents are becoming prevalent in our daily ...
research
06/13/2022

Hate Speech and Counter Speech Detection: Conversational Context Does Matter

Hate speech is plaguing the cyberspace along with user-generated content...
research
04/19/2022

Understanding Toxicity Triggers on Reddit in the Context of Singapore

While the contagious nature of online toxicity sparked increasing intere...
research
03/13/2020

WAC: A Corpus of Wikipedia Conversations for Online Abuse Detection

With the spread of online social networks, it is more and more difficult...
research
11/16/2020

Conversational agents for learning foreign languages – a survey

Conversational practice, while crucial for all language learners, can be...
research
09/30/2019

ATOL: Measure Vectorisation for Automatic Topologically-Oriented Learning

Robust topological information commonly comes in the form of a set of pe...

Please sign up or login with your details

Forgot password? Click here to reset