Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings

03/28/2022
by   Niko Brümmer, et al.
0

In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring or PLDA. Both have advantages and disadvantages, depending on the context. Cosine scoring follows naturally from the spherical geometry, but for PLDA the blessing is mixed – length normalization Gaussianizes the between-speaker distribution, but violates the assumption of a speaker-independent within-speaker distribution. We propose PSDA, an analogue to PLDA that uses Von Mises-Fisher distributions on the hypersphere for both within and between-class distributions. We show how the self-conjugacy of this distribution gives closed-form likelihood-ratio scores, making it a drop-in replacement for PLDA at scoring time. All kinds of trials can be scored, including single-enroll and multi-enroll verification, as well as more complex likelihood-ratios that could be used in clustering and diarization. Learning is done via an EM-algorithm with closed-form updates. We explain the model and present some first experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2022

Toroidal Probabilistic Spherical Discriminant Analysis

In speaker recognition, where speech segments are mapped to embeddings o...
research
04/22/2022

Unifying Cosine and PLDA Back-ends for Speaker Verification

State-of-art speaker verification (SV) systems use a back-end model to s...
research
02/19/2023

Probabilistic Back-ends for Online Speaker Recognition and Clustering

This paper focuses on multi-enrollment speaker recognition which natural...
research
03/09/2018

Scoring Formulation for Multi-Condition Joint PLDA

The joint PLDA model, is a generalization of PLDA where the nuisance var...
research
04/06/2020

Probabilistic embeddings for speaker diarization

Speaker embeddings (x-vectors) extracted from very short segments of spe...
research
02/27/2018

Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model

Embeddings in machine learning are low-dimensional representations of co...
research
04/25/2022

Back-ends Selection for Deep Speaker Embeddings

Probabilistic Linear Discriminant Analysis (PLDA) was the dominant and n...

Please sign up or login with your details

Forgot password? Click here to reset