Gaussian-Constrained training for speaker verification

11/08/2018
by   Lantian Li, et al.
0

Neural models, in particular the d-vector and x-vector architectures, have produced state-of-the-art performance on many speaker verification tasks. However, two potential problems of these neural models deserve more investigation. Firstly, both models suffer from `information leak', which means that some parameters participating in model training will be discarded during inference, i.e, the layers that are used as the classifier. Secondly, both models do not regulate the distribution of the derived speaker vectors. This `unconstrained distribution' may degrade the performance of the subsequent scoring component, e.g., PLDA. This paper proposes a Gaussian-constrained training approach that (1) discards the parametric classifier, and (2) enforces the distribution of the derived speaker vectors to be Gaussian. Our experiments on the VoxCeleb and SITW databases demonstrated that this new training approach produced more representative and regular speaker embeddings, leading to consistent performance improvement.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2017

Speaker Diarization with LSTM

For many years, i-vector based speaker embedding techniques were the dom...
research
04/07/2020

Deep Normalization for Speaker Vectors

Deep speaker embedding has demonstrated state-of-the-art performance in ...
research
10/31/2017

Full-info Training for Deep Speaker Feature Learning

In recent studies, it has shown that speaker patterns can be learned fro...
research
03/22/2022

Speaker recognition with a MLP classifier and LPCC codebook

This paper improves the speaker recognition rates of a MLP classifier an...
research
04/07/2019

VAE-based regularization for deep speaker embedding

Deep speaker embedding has achieved state-of-the-art performance in spea...
research
10/30/2020

Deep Speaker Vector Normalization with Maximum Gaussianality Training

Deep speaker embedding represents the state-of-the-art technique for spe...
research
11/24/2021

An MAP Estimation for Between-Class Variance

Probabilistic linear discriminant analysis (PLDA) has been widely used i...

Please sign up or login with your details

Forgot password? Click here to reset