A Principle Solution for Enroll-Test Mismatch in Speaker Recognition

12/23/2020
by   Lantian Li, et al.
0

Mismatch between enrollment and test conditions causes serious performance degradation on speaker recognition systems. This paper presents a statistics decomposition (SD) approach to solve this problem. This approach is based on the normalized likelihood (NL) scoring framework, and is theoretically optimal if the statistics on both the enrollment and test conditions are accurate. A comprehensive experimental study was conducted on three datasets with different types of mismatch: (1) physical channel mismatch, (2) speaking style mismatch, (3) near-far recording mismatch. The results demonstrated that the proposed SD approach is highly effective, and outperforms the ad-hoc multi-condition training approach that is commonly adopted but not optimal in theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2020

Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification

Domain mismatch often occurs in real applications and causes serious per...
research
01/23/2019

Spherical sampling methods for the calculation of metamer mismatch volumes

In this paper, we propose two methods of calculating theoretically maxim...
research
12/23/2020

CN-Celeb: multi-genre speaker recognition

Research on speaker recognition is extending to address the vulnerabilit...
research
04/01/2022

Speaker verification in mismatch training and testing conditions

This paper presents an exhaustive study about the robustness of several ...
research
06/22/2017

Cross-lingual Speaker Verification with Deep Feature Learning

Existing speaker verification (SV) systems often suffer from performance...
research
02/24/2022

A comparative study of several parameterizations for speaker recognition

This paper presents an exhaustive study about the robustness of several ...
research
08/11/2020

Why Did the x-Vector System Miss a Target Speaker? Impact of Acoustic Mismatch Upon Target Score on VoxCeleb Data

Modern automatic speaker verification (ASV) relies heavily on machine le...

Please sign up or login with your details

Forgot password? Click here to reset