Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free

07/25/2022
by   M. Iftekhar Tanveer, et al.
2

Podcasts are conversational in nature and speaker changes are frequent – requiring speaker diarization for content understanding. We propose an unsupervised technique for speaker diarization without relying on language-specific components. The algorithm is overlap-aware and does not require information about the number of speakers. Our approach shows 79 improvement on purity scores (34 solution on podcast data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2023

TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization

Recently, end-to-end neural diarization (EEND) is introduced and achieve...
research
09/02/2019

Identifying Personality Traits Using Overlap Dynamics in Multiparty Dialogue

Research on human spoken language has shown that speech plays an importa...
research
08/04/2023

Speaker Diarization of Scripted Audiovisual Content

The media localization industry usually requires a verbatim script of th...
research
10/25/2019

Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection

We address the problem of effectively handling overlapping speech in a d...
research
05/09/2019

Adversarially Trained Autoencoders for Parallel-Data-Free Voice Conversion

We present a method for converting the voices between a set of speakers....
research
08/29/2017

Comparing Human and Machine Errors in Conversational Speech Transcription

Recent work in automatic recognition of conversational telephone speech ...
research
03/06/2020

Lightweight Speaker Verification for Online Identification of New Speakers with Short Segments

Verifying if two audio segments belong to the same speaker has been rece...

Please sign up or login with your details

Forgot password? Click here to reset