Robust speaker recognition using unsupervised adversarial invariance

11/03/2019
by   Raghuveer Peri, et al.
0

In this paper, we address the problem of speaker recognition in challenging acoustic conditions using a novel method to extract robust speaker-discriminative speech representations. We adopt a recently proposed unsupervised adversarial invariance architecture to train a network that maps speaker embeddings extracted using a pre-trained model onto two lower dimensional embedding spaces. The embedding spaces are learnt to disentangle speaker-discriminative information from all other information present in the audio recordings, without supervision about the acoustic conditions. We analyze the robustness of the proposed embeddings to various sources of variability present in the signal for speaker verification and unsupervised clustering tasks on a large-scale speaker recognition corpus. Our analyses show that the proposed system substantially outperforms the baseline in a variety of challenging acoustic scenarios. Furthermore, for the task of speaker diarization on a real-world meeting corpus, our system shows a relative improvement of 36% in the diarization error rate compared to the state-of-the-art baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/07/2020

Learning Speaker Embedding with Momentum Contrast

Speaker verification can be formulated as a representation learning task...
research
05/05/2017

Deep Speaker: an End-to-End Neural Speaker Embedding System

We present Deep Speaker, a neural speaker embedding system that maps utt...
research
02/10/2020

An empirical analysis of information encoded in disentangled neural speaker representations

The primary characteristic of robust speaker representations is that the...
research
04/18/2018

Unspeech: Unsupervised Speech Context Embeddings

We introduce "Unspeech" embeddings, which are based on unsupervised lear...
research
08/05/2022

Robust Acoustic Domain Identification with its Application to Speaker Diarization

With the rise in multimedia content over the years, more variety is obse...
research
10/25/2019

Channel adversarial training for speaker verification and diarization

Previous work has encouraged domain-invariance in deep speaker embedding...
research
10/22/2019

Discriminative Neural Clustering for Speaker Diarisation

This paper proposes a novel method for supervised data clustering. The c...

Please sign up or login with your details

Forgot password? Click here to reset