Speech Toxicity Analysis: A New Spoken Language Processing Task

10/14/2021
by   Sreyan Ghosh, et al.
27

Toxic speech, also known as hate speech, is regarded as one of the crucial issues plaguing online social media today. Most recent work on toxic speech detection is constrained to the modality of text with no existing work on toxicity detection from spoken utterances. In this paper, we propose a new Spoken Language Processing task of detecting toxicity from spoken speech. We introduce DeToxy, the first publicly available toxicity annotated dataset for English speech, sourced from various openly available speech databases, consisting of over 2 million utterances. Finally, we also provide analysis on how a spoken speech corpus annotated for toxicity can help facilitate the development of E2E models which better capture various prosodic cues in speech, thereby boosting toxicity classification on spoken utterances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2022

A novel multimodal dynamic fusion network for disfluency detection in spoken utterances

Disfluency, though originating from human spoken utterances, is primaril...
research
02/22/2021

Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data

This paper explores the difficulties of annotating transcribed spoken Du...
research
03/10/2021

Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System

Dysarthria is a condition which hampers the ability of an individual to ...
research
04/06/2015

A Metric to Classify Style of Spoken Speech

The ability to classify spoken speech based on the style of speaking is ...
research
06/03/2021

Eliciting Spoken Interruptions to Inform Proactive Speech Agent Design

Current speech agent interactions are typically user-initiated, limiting...
research
06/03/2021

Language Independent Speech Emotion and Non-invasive Early Detection of Neurocognitive Disorder

Emotions(like fear,anger,sadness,happiness etc.) are the fundamental fea...
research
09/19/2023

Multimodal Modeling For Spoken Language Identification

Spoken language identification refers to the task of automatically predi...

Please sign up or login with your details

Forgot password? Click here to reset