Studying Socially Unacceptable Discourse Classification (SUD) through different eyes: "Are we on the same page ?"

08/08/2023
by   Bruno Machado Carneiro, et al.
0

We study Socially Unacceptable Discourse (SUD) characterization and detection in online text. We first build and present a novel corpus that contains a large variety of manually annotated texts from different online sources used so far in state-of-the-art Machine learning (ML) SUD detection solutions. This global context allows us to test the generalization ability of SUD classifiers that acquire knowledge around the same SUD categories, but from different contexts. From this perspective, we can analyze how (possibly) different annotation modalities influence SUD learning by discussing open challenges and open research directions. We also provide several data insights which can support domain experts in the annotation task.

READ FULL TEXT

page 2

page 4

page 5

research
06/25/2021

Persian Rhetorical Structure Theory

Over the past years, interest in discourse analysis and discourse parsin...
research
06/10/2018

SciDTB: Discourse Dependency TreeBank for Scientific Abstracts

Annotation corpus for discourse relations benefits NLP tasks such as mac...
research
08/17/2023

Characterizing Information Seeking Events in Health-Related Social Discourse

Social media sites have become a popular platform for individuals to see...
research
03/31/2015

Towards Using Machine Translation Techniques to Induce Multilingual Lexica of Discourse Markers

Discourse markers are universal linguistic events subject to language va...
research
04/28/2017

How consistent are our discourse annotations? Insights from mapping RST-DT and PDTB annotations

Discourse-annotated corpora are an important resource for the community....
research
02/26/2023

Understanding Social Media Cross-Modality Discourse in Linguistic Space

The multimedia communications with texts and images are popular on socia...
research
06/05/2019

The FRENK Datasets of Socially Unacceptable Discourse in Slovene and English

In this paper we present datasets of Facebook comment threads to mainstr...

Please sign up or login with your details

Forgot password? Click here to reset