Overlapped speech and gender detection with WavLM pre-trained features

09/09/2022
by   Martin Lebourdais, et al.
0

This article focuses on overlapped speech and gender detection in order to study interactions between women and men in French audiovisual media (Gender Equality Monitoring project). In this application context, we need to automatically segment the speech signal according to speakers gender, and to identify when at least two speakers speak at the same time. We propose to use WavLM model which has the advantage of being pre-trained on a huge amount of speech data, to build an overlapped speech detection (OSD) and a gender detection (GD) systems. In this study, we use two different corpora. The DIHARD III corpus which is well adapted for the OSD task but lack gender information. The ALLIES corpus fits with the project application context. Our best OSD system is a Temporal Convolutional Network (TCN) with WavLM pre-trained features as input, which reaches a new state-of-the-art F1-score performance on DIHARD. A neural GD is trained with WavLM inputs on a gender balanced subset of the French broadcast news ALLIES data, and obtains an accuracy of 97.9 work opens new perspectives for human science researchers regarding the differences of representation between women and men in French media.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2019

Gender Representation in French Broadcast Corpora and Its Impact on ASR Performance

This paper analyzes the gender representation in four major corpora of F...
research
04/04/2022

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems

Self-supervised models for speech processing emerged recently as popular...
research
03/18/2020

Gender Representation in Open Source Speech Resources

With the rise of artificial intelligence (AI) and the growing use of dee...
research
03/26/2021

Leveraging neural representations for facilitating access to untranscribed speech from endangered languages

For languages with insufficient resources to train speech recognition sy...
research
09/15/2020

Pardon the Interruption: An Analysis of Gender and Turn-Taking in U.S. Supreme Court Oral Arguments

This study presents a corpus of turn changes between speakers in U.S. Su...
research
05/29/2018

Entrainment profiles: Comparison by gender, role, and feature set

We examine prosodic entrainment in cooperative game dialogs for new feat...
research
02/11/2022

GenderedNews: Une approche computationnelle des écarts de représentation des genres dans la presse française

In this article, we present GenderedNews (<https://gendered-news.imag.fr...

Please sign up or login with your details

Forgot password? Click here to reset