Cross-lingual Speaker Verification with Deep Feature Learning

06/22/2017
by   Lantian Li, et al.
0

Existing speaker verification (SV) systems often suffer from performance degradation if there is any language mismatch between model training, speaker enrollment, and test. A major cause of this degradation is that most existing SV methods rely on a probabilistic model to infer the speaker factor, so any significant change on the distribution of the speech signal will impact the inference. Recently, we proposed a deep learning model that can learn how to extract the speaker factor by a deep neural network (DNN). By this feature learning, an SV system can be constructed with a very simple back-end model. In this paper, we investigate the robustness of the feature-based SV system in situations with language mismatch. Our experiments were conducted on a complex cross-lingual scenario, where the model training was in English, and the enrollment and test were in Chinese or Uyghur. The experiments demonstrated that the feature-based system outperformed the i-vector system with a large margin, particularly with language mismatch between enrollment and test.

READ FULL TEXT
research
10/18/2021

Tackling the Score Shift in Cross-Lingual Speaker Verification by Exploiting Language Information

This paper contains a post-challenge performance analysis on cross-lingu...
research
08/11/2020

Why Did the x-Vector System Miss a Target Speaker? Impact of Acoustic Mismatch Upon Target Score on VoxCeleb Data

Modern automatic speaker verification (ASV) relies heavily on machine le...
research
04/08/2020

Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification

Speaker verification systems usually suffer from the mismatch problem be...
research
10/14/2021

Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech

In this paper, we present a FastPitch-based non-autoregressive cross-lin...
research
07/10/2018

Two-stage iterative Procrustes match algorithm and its application for VQ-based speaker verification

In the past decades, Vector Quantization (VQ) model has been very popula...
research
06/22/2017

Deep Speaker Verification: Do We Need End to End?

End-to-end learning treats the entire system as a whole adaptable black ...
research
12/23/2020

A Principle Solution for Enroll-Test Mismatch in Speaker Recognition

Mismatch between enrollment and test conditions causes serious performan...

Please sign up or login with your details

Forgot password? Click here to reset