ADIMA: Abuse Detection In Multilingual Audio

02/16/2022
by   Vikram Gupta, et al.
0

Abusive content detection in spoken text can be addressed by performing Automatic Speech Recognition (ASR) and leveraging advancements in natural language processing. However, ASR models introduce latency and often perform sub-optimally for profane words as they are underrepresented in training corpora and not spoken clearly or completely. Exploration of this problem entirely in the audio domain has largely been limited by the lack of audio datasets. Building on these challenges, we propose ADIMA, a novel, linguistically diverse, ethically sourced, expert annotated and well-balanced multilingual profanity detection audio dataset comprising of 11,775 audio samples in 10 Indic languages spanning 65 hours and spoken by 6,446 unique users. Through quantitative experiments across monolingual and cross-lingual zero-shot settings, we take the first step in democratizing audio based content moderation in Indic languages and set forth our dataset to pave future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2020

Cross-lingual and Multilingual Spoken Term Detection for Low-Resource Indian Languages

Spoken Term Detection (STD) is the task of searching for words or phrase...
research
10/07/2021

Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0

We propose a simple and effective cross-lingual transfer learning method...
research
03/22/2023

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages

The advancement of speech technologies has been remarkable, yet its inte...
research
07/12/2020

Fine-grained Language Identification with Multilingual CapsNet Model

Due to a drastic improvement in the quality of internet services worldwi...
research
05/03/2023

Plug-and-Play Multilingual Few-shot Spoken Words Recognition

As technology advances and digital devices become prevalent, seamless hu...
research
04/08/2020

The Spotify Podcasts Dataset

Podcasts are a relatively new form of audio media. Episodes appear on a ...
research
04/15/2018

Transcribing Lyrics From Commercial Song Audio: The First Step Towards Singing Content Processing

Spoken content processing (such as retrieval and browsing) is maturing, ...

Please sign up or login with your details

Forgot password? Click here to reset