System Combination for Short Utterance Speaker Recognition

03/31/2016
by   Lantian Li, et al.
0

For text-independent short-utterance speaker recognition (SUSR), the performance often degrades dramatically. This paper presents a combination approach to the SUSR tasks with two phonetic-aware systems: one is the DNN-based i-vector system and the other is our recently proposed subregion-based GMM-UBM system. The former employs phone posteriors to construct an i-vector model in which the shared statistics offers stronger robustness against limited test data, while the latter establishes a phone-dependent GMM-UBM system which represents speaker characteristics with more details. A score-level fusion is implemented to integrate the respective advantages from the two systems. Experimental results show that for the text-independent SUSR task, both the DNN-based i-vector system and the subregion-based GMM-UBM system outperform their respective baselines, and the score-level system combination delivers performance improvement.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 6

research
05/24/2015

Deep Speaker Vectors for Semi Text-independent Speaker Verification

Recent research shows that deep neural networks (DNNs) can be used to ex...
research
06/28/2015

Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition

A deep learning approach has been proposed recently to derive speaker id...
research
03/28/2019

Deep Neural Network Embedding Learning with High-Order Statistics for Text-Independent Speaker Verification

The x-vector based deep neural network (DNN) embedding systems have demo...
research
03/28/2019

Multi-Task Learning with High-Order Statistics for X-vector based Text-Independent Speaker Verification

The x-vector based deep neural network (DNN) embedding systems have demo...
research
04/01/2018

I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification

I-vector based text-independent speaker verification (SV) systems often ...
research
05/15/2020

On Bottleneck Features for Text-Dependent Speaker Verification Using X-vectors

Applying x-vectors for speaker verification has recently attracted great...
research
06/18/2021

Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition

By implicitly recognizing a user based on his/her speech input, speaker ...

Please sign up or login with your details

Forgot password? Click here to reset