Disentangled dimensionality reduction for noise-robust speaker diarisation

10/07/2021
by   You Jin Kim, et al.
0

The objective of this work is to train noise-robust speaker embeddings for speaker diarisation. Speaker embeddings play a crucial role in the performance of diarisation systems, but they often capture spurious information such as noise and reverberation, adversely affecting performance. Our previous work have proposed an auto-encoder-based dimensionality reduction module to help remove the spurious information. However, they do not explicitly separate such information and have also been found to be sensitive to hyperparameter values. To this end, we propose two contributions to overcome these issues: (i) a novel dimensionality reduction framework that can disentangle spurious information from the speaker embeddings; (ii) the use of a speech/non-speech indicator to prevent the speaker code from learning from the background noise. Through a range of experiments conducted on four different datasets, our approach consistently demonstrates the state-of-the-art performance among models that do not adopt ensembles.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2021

Adapting Speaker Embeddings for Speaker Diarisation

The goal of this paper is to adapt speaker embeddings for solving the pr...
research
07/24/2021

Significance of Speaker Embeddings and Temporal Context for Depression Detection

Depression detection from speech has attracted a lot of attention in rec...
research
11/07/2018

Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification

This article presents a novel approach for learning domain-invariant spe...
research
02/10/2021

ABSP System for The Third DIHARD Challenge

This report describes the speaker diarization system developed by the AB...
research
08/04/2020

Intra-class variation reduction of speaker representation in disentanglement framework

In this paper, we propose an effective training strategy to ex-tract rob...
research
01/25/2021

Domain-Dependent Speaker Diarization for the Third DIHARD Challenge

This report presents the system developed by the ABSP Laboratory team fo...
research
06/20/2022

GiDR-DUN; Gradient Dimensionality Reduction – Differences and Unification

TSNE and UMAP are two of the most popular dimensionality reduction algor...

Please sign up or login with your details

Forgot password? Click here to reset