Deep Speaker Feature Learning for Text-independent Speaker Verification

05/10/2017
by   Lantian Li, et al.
0

Recently deep neural networks (DNNs) have been used to learn speaker features. However, the quality of the learned features is not sufficiently good, so a complex back-end model, either neural or probabilistic, has to be used to address the residual uncertainty when applied to speaker verification, just as with raw features. This paper presents a convolutional time-delay deep neural network structure (CT-DNN) for speaker feature learning. Our experimental results on the Fisher database demonstrated that this CT-DNN can produce high-quality speaker features: even with a single feature (0.3 seconds including the context), the EER can be as low as 7.68 confirmed that the speaker trait is largely a deterministic short-time property rather than a long-time distributional pattern, and therefore can be extracted from just dozens of frames.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2017

Full-info Training for Deep Speaker Feature Learning

In recent studies, it has shown that speaker patterns can be learned fro...
research
06/22/2017

Deep Speaker Verification: Do We Need End to End?

End-to-end learning treats the entire system as a whole adaptable black ...
research
07/30/2020

A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings

Modern automatic speaker verification relies largely on deep neural netw...
research
09/24/2021

Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification

After their introduction to robust speech recognition, power normalized ...
research
09/08/2018

Dual-label Deep LSTM Dereverberation For Speaker Verification

In this paper, we present a reverberation removal approach for speaker v...
research
08/17/2020

Bayesian deep learning for mapping via auxiliary information: a new era for geostatistics?

For geospatial modelling and mapping tasks, variants of kriging - the sp...
research
05/19/2018

Sparse Architectures for Text-Independent Speaker Verification Using Deep Neural Networks

Network pruning is of great importance due to the elimination of the uni...

Please sign up or login with your details

Forgot password? Click here to reset