On Bottleneck Features for Text-Dependent Speaker Verification Using X-vectors

05/15/2020
by   Achintya Kumar Sarkar, et al.
0

Applying x-vectors for speaker verification has recently attracted great interest, with the focus being on text-independent speaker verification. In this paper, we study x-vectors for text-dependent speaker verification (TD-SV), which remains unexplored. We further investigate the impact of the different bottleneck (BN) features on the performance of x-vectors, including the recently-introduced time-contrastive-learning (TCL) BN features and phone-discriminant BN features. TCL is a weakly supervised learning approach that constructs training data by uniformly partitioning each utterance into a predefined number of segments and then assigning each segment a class label depending on their position in the utterance. We also compare TD-SV performance for different modeling techniques, including the Gaussian mixture models-universal background model (GMM-UBM), i-vector, and x-vector. Experiments are conducted on the RedDots 2016 challenge database. It is found that the type of features has a marginal impact on the performance of x-vectors with the TCL BN feature achieving the lowest equal error rate, while the impact of features is significant for i-vector and GMM-UBM. The fusion of x-vector and i-vector systems gives a large gain in performance. The GMM-UBM technique shows its advantage for TD-SV using short utterances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2020

UIAI System for Short-Duration Speaker Verification Challenge 2020

In this work, we present the system description of the UIAI entry for th...
research
11/25/2020

Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding

In this letter, we propose a vocal tract length (VTL) perturbation metho...
research
05/03/2018

Supervector Compression Strategies to Speed up I-Vector System Development

The front-end factor analysis (FEFA), an extension of principal componen...
research
05/11/2019

Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification

There are a number of studies about extraction of bottleneck (BN) featur...
research
01/17/2022

On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification

Deep representation learning has gained significant momentum in advancin...
research
07/13/2019

Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors

In this paper, we combine Hidden Markov Models (HMMs) with i-vector extr...
research
03/31/2016

System Combination for Short Utterance Speaker Recognition

For text-independent short-utterance speaker recognition (SUSR), the per...

Please sign up or login with your details

Forgot password? Click here to reset