Toxicity Detection: Does Context Really Matter?

06/01/2020
by   John Pavlopoulos, et al.
0

Moderation is crucial to promoting healthy on-line discussions. Although several `toxicity' detection datasets and models have been published, most of them ignore the context of the posts, implicitly assuming that comments maybe judged independently. We investigate this assumption by focusing on two questions: (a) does context affect the human judgement, and (b) does conditioning on context improve performance of toxicity detection systems? We experiment with Wikipedia conversations, limiting the notion of context to the previous post in the thread and the discussion title. We find that context can both amplify or mitigate the perceived toxicity of posts. Moreover, a small but significant subset of manually labeled posts (5 up having the opposite toxicity labels if the annotators are not provided with context. Surprisingly, we also find no evidence that context actually improves the performance of toxicity classifiers, having tried a range of classifiers and mechanisms to make them context aware. This points to the need for larger datasets of comments annotated in context. We make our code and data publicly available.

READ FULL TEXT
research
01/26/2022

Explainable Patterns for Distinction and Prediction of Moral Judgement on Reddit

The forum r/AmITheAsshole in Reddit hosts discussion on moral issues bas...
research
11/19/2021

Toxicity Detection can be Sensitive to the Conversational Context

User posts whose perceived toxicity depends on the conversational contex...
research
04/16/2023

A Study of Update Request Comments in Stack Overflow Answer Posts

Comments play an important role in updating Stack Overflow (SO) posts. T...
research
11/10/2020

Does Social Support Expressed in Post Titles Elicit Comments in Online Substance Use Recovery Forums?

Individuals recovering from substance use often seek social support (emo...
research
04/15/2019

Something's Brewing! Early Prediction of Controversy-causing Posts from Discussion Features

Controversial posts are those that split the preferences of a community,...
research
06/16/2022

Enriching Abusive Language Detection with Community Context

Uses of pejorative expressions can be benign or actively empowering. Whe...
research
10/25/2019

Exploring Author Context for Detecting Intended vs Perceived Sarcasm

We investigate the impact of using author context on textual sarcasm det...

Please sign up or login with your details

Forgot password? Click here to reset