Domain-Dependent Speaker Diarization for the Third DIHARD Challenge

01/25/2021
by   A Kishore Kumar, et al.
0

This report presents the system developed by the ABSP Laboratory team for the third DIHARD speech diarization challenge. Our main contribution in this work is to develop a simple and efficient solution for acoustic domain dependent speech diarization. We explore speaker embeddings for acoustic domain identification (ADI) task. Our study reveals that i-vector based method achieves considerably better performance than x-vector based approach in the third DIHARD challenge dataset. Next, we integrate the ADI module with the diarization framework. The performance substantially improved over that of the baseline when we optimized the thresholds for agglomerative hierarchical clustering and the parameters for dimensionality reduction during scoring for individual acoustic domains. We achieved a relative improvement of 9.63% and 10.64% in DER for core and full conditions, respectively, for Track 1 of the DIHARD III evaluation set.

READ FULL TEXT

page 1

page 2

research
02/10/2021

ABSP System for The Third DIHARD Challenge

This report describes the speaker diarization system developed by the AB...
research
08/05/2022

Robust Acoustic Domain Identification with its Application to Speaker Diarization

With the rise in multimedia content over the years, more variety is obse...
research
11/07/2018

Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification

This article presents a novel approach for learning domain-invariant spe...
research
04/08/2019

Improved Speaker-Dependent Separation for CHiME-5 Challenge

This paper summarizes several follow-up contributions for improving our ...
research
10/07/2021

Disentangled dimensionality reduction for noise-robust speaker diarisation

The objective of this work is to train noise-robust speaker embeddings f...
research
10/29/2021

VRAIN-UPV MLLP's system for the Blizzard Challenge 2021

This paper presents the VRAIN-UPV MLLP's speech synthesis system for the...
research
03/19/2021

USTC-NELSLIP System Description for DIHARD-III Challenge

This system description describes our submission system to the Third DIH...

Please sign up or login with your details

Forgot password? Click here to reset