Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings

07/13/2022
by   Xiaoyi Qin, et al.
0

Automatic speaker verification has achieved remarkable progress in recent years. However, there is little research on cross-age speaker verification (CASV) due to insufficient relevant data. In this paper, we mine cross-age test sets based on the VoxCeleb dataset and propose our age-invariant speaker representation(AISR) learning method. Since the VoxCeleb is collected from the YouTube platform, the dataset consists of cross-age data inherently. However, the meta-data does not contain the speaker age label. Therefore, we adopt the face age estimation method to predict the speaker age value from the associated visual data, then label the audio recording with the estimated age. We construct multiple Cross-Age test sets on VoxCeleb (Vox-CA), which deliberately select the positive trials with large age-gap. Also, the effect of nationality and gender is considered in selecting negative pairs to align with Vox-H cases. The baseline system performance drops from 1.939% EER on the Vox-H test set to 10.419% on the Vox-CA20 test set, which indicates how difficult the cross-age scenario is. Consequently, we propose an age-decoupling adversarial learning (ADAL) method to alleviate the negative effect of the age gap and reduce intra-class variance. Our method outperforms the baseline system by over 10% related EER reduction on the Vox-CA20 test set. The source code and trial resources are available on https://github.com/qinxiaoyi/Cross-Age_Speaker_Verification

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2022

Towards Speaker Age Estimation with Label Distribution Learning

Existing methods for speaker age estimation usually treat it as a multi-...
research
04/07/2023

Margin-Mixup: A Method for Robust Speaker Verification in Multi-Speaker Audio

This paper is concerned with the task of speaker verification on audio w...
research
11/10/2021

Inclusive Speaker Verification with Adaptive thresholding

While using a speaker verification (SV) based system in a commercial app...
research
08/28/2017

Cross-Age LFW: A Database for Studying Cross-Age Face Recognition in Unconstrained Environments

Labeled Faces in the Wild (LFW) database has been widely utilized as the...
research
10/28/2022

Laugh Betrays You? Learning Robust Speaker Representation From Speech Containing Non-Verbal Fragments

The success of automatic speaker verification shows that discriminative ...
research
03/06/2022

C-P Map: A Novel Evaluation Toolkit for Speaker Verification

Evaluation trials are used to probe performance of automatic speaker ver...
research
10/16/2021

Face Verification with Challenging Imposters and Diversified Demographics

Face verification aims to distinguish between genuine and imposter pairs...

Please sign up or login with your details

Forgot password? Click here to reset