Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework

05/25/2021
by   Nirmalya Sen, et al.
6

The performance of speaker recognition system is highly dependent on the amount of speech used in enrollment and test. This work presents a detailed experimental review and analysis of the GMM-SVM based speaker recognition system in presence of duration variability. This article also reports a comparison of the performance of GMM-SVM classifier with its precursor technique Gaussian mixture model-universal background model (GMM-UBM) classifier in presence of duration variability. The goal of this research work is not to propose a new algorithm for improving speaker recognition performance in presence of duration variability. However, the main focus of this work is on utterance partitioning (UP), a commonly used strategy to compensate the duration variability issue. We have analysed in detailed the impact of training utterance partitioning in speaker recognition performance under GMM-SVM framework. We further investigate the reason why the utterance partitioning is important for boosting speaker recognition performance. We have also shown in which case the utterance partitioning could be useful and where not. Our study has revealed that utterance partitioning does not reduce the data imbalance problem of the GMM-SVM classifier as claimed in earlier study. Apart from these, we also discuss issues related to the impact of parameters such as number of Gaussians, supervector length, amount of splitting required for obtaining better performance in short and long duration test conditions from speech duration perspective. We have performed the experiments with telephone speech from POLYCOST corpus consisting of 130 speakers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2016

Incorporation of Speech Duration Information in Score Fusion of Speaker Recognition Systems

In recent years identity-vector (i-vector) based speaker verification (S...
research
08/26/2022

Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation

Many automatic speech recognition (ASR) data sets include a single pre-d...
research
02/03/2020

Within-sample variability-invariant loss for robust speaker recognition under noisy environments

Despite the significant improvements in speaker recognition enabled by d...
research
06/07/2022

The Influence of Dataset Partitioning on Dysfluency Detection Systems

This paper empirically investigates the influence of different data spli...
research
02/10/2023

Spoken language change detection inspired by speaker change detection

Spoken language change detection (LCD) refers to identifying the languag...
research
12/03/2018

Novel Quality Metric for Duration Variability Compensation in Speaker Verification using i-Vectors

Automatic speaker verification (ASV) is the process to recognize persons...
research
06/30/2023

VoxWatch: An open-set speaker recognition benchmark on VoxCeleb

Despite its broad practical applications such as in fraud prevention, op...

Please sign up or login with your details

Forgot password? Click here to reset