Pairwise Discriminative Neural PLDA for Speaker Verification

01/20/2020
by   Shreyas Ramoji, et al.
0

The state-of-art approach to speaker verification involves the extraction of discriminative embeddings like x-vectors followed by a generative model back-end using a probabilistic linear discriminant analysis (PLDA). In this paper, we propose a Pairwise neural discriminative model for the task of speaker verification which operates on a pair of speaker embeddings such as x-vectors/i-vectors and outputs a score that can be considered as a scaled log-likelihood ratio. We construct a differentiable cost function which approximates speaker verification loss, namely the minimum detection cost. The pre-processing steps of linear discriminant analysis (LDA), unit length normalization and within class covariance normalization are all modeled as layers of a neural model and the speaker verification cost functions can be back-propagated through these layers during training. We also explore regularization techniques to prevent overfitting, which is a major concern in using discriminative back-end models for verification tasks. The experiments are performed on the NIST SRE 2018 development and evaluation datasets. We observe average relative improvements of 8 condition over the PLDA baseline system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2020

Neural PLDA Modeling for End-to-End Speaker Verification

While deep learning models have made significant advances in supervised ...
research
06/08/2018

Analysis of Length Normalization in End-to-End Speaker Verification System

The classical i-vectors and the latest end-to-end deep speaker embedding...
research
11/24/2021

A Study on Decoupled Probabilistic Linear Discriminant Analysis

Probabilistic linear discriminant analysis (PLDA) has broad application ...
research
02/23/2023

Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification

Speech utterances recorded under differing conditions exhibit varying de...
research
08/25/2018

Multiobjective Optimization Training of PLDA for Speaker Verification

Most current state-of-the-art text-independent speaker verification syst...
research
01/09/2021

Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification

The task for speaker verification (SV) is to decide an utterance is spok...
research
04/07/2021

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Generative probability models are widely used for speaker verification (...

Please sign up or login with your details

Forgot password? Click here to reset