Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System

04/14/2021
by   Ju-ho Kim, et al.
0

Most speaker verification tasks are studied as an open-set evaluation scenario considering the real-world condition. Thus, the generalization power to unseen speakers is of paramount important to the performance of the speaker verification system. We propose to apply Mean Teacher, a temporal averaging model, to extract speaker embeddings with small intra-class variance and large inter-class variance. The mean teacher network refers to the temporal averaging of deep neural network parameters; it can produces more accurate and stable representations than using weights after the training finished. By learning the reliable intermediate representation of the mean teacher network, we expect that the proposed method can explore more discriminatory embedding spaces and improve the generalization performance of the speaker verification system. Experimental results on the VoxCeleb1 test set demonstrate that the proposed method relatively improves performance by 11.61%, compared to a baseline system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2020

Crop Aggregating for short utterances speaker verification using raw waveforms

Most studies on speaker verification systems focus on long-duration utte...
research
05/07/2020

Segment Aggregation for short utterances speaker verification using raw waveforms

Most studies on speaker verification systems focus on long-duration utte...
research
03/20/2020

Improving Embedding Extraction for Speaker Verification with Ladder Network

Speaker verification is an established yet challenging task in speech pr...
research
03/02/2023

Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification

Deep speaker models yield low error rates in speaker verification. Nonet...
research
11/24/2021

An MAP Estimation for Between-Class Variance

Probabilistic linear discriminant analysis (PLDA) has been widely used i...
research
06/17/2021

Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification

In far-field speaker verification, the performance of speaker embeddings...
research
02/10/2022

Learnable Nonlinear Compression for Robust Speaker Verification

In this study, we focus on nonlinear compression methods in spectral fea...

Please sign up or login with your details

Forgot password? Click here to reset