A Study on Decoupled Probabilistic Linear Discriminant Analysis

11/24/2021
by   Di Wang, et al.
0

Probabilistic linear discriminant analysis (PLDA) has broad application in open-set verification tasks, such as speaker verification. A key concern for PLDA is that the model is too simple (linear Gaussian) to deal with complicated data; however, the simplicity by itself is a major advantage of PLDA, as it leads to desirable generalization. An interesting research therefore is how to improve modeling capacity of PLDA while retaining the simplicity. This paper presents a decoupling approach, which involves a global model that is simple and generalizable, and a local model that is complex and expressive. While the global model holds a bird view on the entire data, the local model represents the details of individual classes. We conduct a preliminary study towards this direction and investigate a simple decoupling model including both the global and local models. The new model, which we call decoupled PLDA, is tested on a speaker verification task. Experimental results show that it consistently outperforms the vanilla PLDA when the model is based on raw speaker vectors. However, when the speaker vectors are processed by length normalization, the advantage of decoupled PLDA will be largely lost, suggesting future research on non-linear local models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2020

Pairwise Discriminative Neural PLDA for Speaker Verification

The state-of-art approach to speaker verification involves the extractio...
research
11/24/2021

An MAP Estimation for Between-Class Variance

Probabilistic linear discriminant analysis (PLDA) has been widely used i...
research
02/12/2018

Linear Regression for Speaker Verification

This paper presents a linear regression based back-end for speaker verif...
research
03/18/2015

Shared latent subspace modelling within Gaussian-Binary Restricted Boltzmann Machines for NIST i-Vector Challenge 2014

This paper presents a novel approach to speaker subspace modelling based...
research
11/04/2020

Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020

In this report, we describe our submission to the VoxCeleb Speaker Recog...
research
09/29/2017

PLDA-Based Diarization of Telephone Conversations

This paper investigates the application of the probabilistic linear disc...
research
03/09/2022

An Environmental Feature Representation in I-vector Space for Room Verification and Metadata Estimation

This paper investigates the application of environmental feature represe...

Please sign up or login with your details

Forgot password? Click here to reset