VB calibration to improve the interface between phone recognizer and i-vector extractor

10/12/2015
by   Niko Brümmer, et al.
0

The EM training algorithm of the classical i-vector extractor is often incorrectly described as a maximum-likelihood method. The i-vector model is however intractable: the likelihood itself and the hidden-variable posteriors needed for the EM algorithm cannot be computed in closed form. We show here that the classical i-vector extractor recipe is actually a mean-field variational Bayes (VB) recipe. This theoretical VB interpretation turns out to be of further use, because it also offers an interpretation of the newer phonetic i-vector extractor recipe, thereby unifying the two flavours of extractor. More importantly, the VB interpretation is also practically useful: it suggests ways of modifying existing i-vector extractors to make them more accurate. In particular, in existing methods, the approximate VB posterior for the GMM states is fixed, while only the parameters of the generative model are adapted. Here we explore the possibility of also mildly adjusting (calibrating) those posteriors, so that they better fit the generative model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2018

Discriminatively Re-trained i-vector Extractor for Speaker Recognition

In this work we revisit discriminative training of the i-vector extracto...
research
11/20/2015

Variational Bayes Factor Analysis for i-Vector Extraction

In this document we are going to derive the equations needed to implemen...
research
04/05/2019

Factorization of Discriminatively Trained i-vector Extractor for Speaker Recognition

In this work, we continue in our research on i-vector extractor for spea...
research
04/03/2020

Neural i-vectors

Deep speaker embeddings have been demonstrated to outperform their gener...
research
12/20/2013

A Generative Product-of-Filters Model of Audio

We propose the product-of-filters (PoF) model, a generative model that d...
research
05/12/2011

Closed-form EM for Sparse Coding and its Application to Source Separation

We define and discuss the first sparse coding algorithm based on closed-...

Please sign up or login with your details

Forgot password? Click here to reset