The SpeakIn Speaker Verification System for Far-Field Speaker Verification Challenge 2022

09/23/2022
by   Yu Zheng, et al.
0

This paper describes speaker verification (SV) systems submitted by the SpeakIn team to the Task 1 and Task 2 of the Far-Field Speaker Verification Challenge 2022 (FFSVC2022). SV tasks of the challenge focus on the problem of fully supervised far-field speaker verification (Task 1) and semi-supervised far-field speaker verification (Task 2). In Task 1, we used the VoxCeleb and FFSVC2020 datasets as train datasets. And for Task 2, we only used the VoxCeleb dataset as train set. The ResNet-based and RepVGG-based architectures were developed for this challenge. Global statistic pooling structure and MQMHA pooling structure were used to aggregate the frame-level features across time to obtain utterance-level representation. We adopted AM-Softmax and AAM-Softmax to classify the resulting embeddings. We innovatively propose a staged transfer learning method. In the pre-training stage we reserve the speaker weights, and there are no positive samples to train them in this stage. Then we fine-tune these weights with both positive and negative samples in the second stage. Compared with the traditional transfer learning strategy, this strategy can better improve the model performance. The Sub-Mean and AS-Norm backend methods were used to solve the problem of domain mismatch. In the fusion stage, three models were fused in Task1 and two models were fused in Task2. On the FFSVC2022 leaderboard, the EER of our submission is 3.0049 is 0.2938 in Task1. In Task2, EER and minDCF are 6.2060 respectively. Our approach leads to excellent performance and ranks 1st in both challenge tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2022

The SpeakIn System Description for CNSRC2022

This report describes our speaker verification systems for the tasks of ...
research
08/08/2020

NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge

This paper describes the NPU system submitted to Interspeech 2020 Far-Fi...
research
10/12/2022

THUEE system description for NIST 2020 SRE CTS challenge

This paper presents the system description of the THUEE team for the NIS...
research
07/03/2021

The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge

This paper describes the systems submitted by team HCCL to the Far-Field...
research
02/26/2021

The NPU System for the 2020 Personalized Voice Trigger Challenge

This paper describes the system developed by the NPU team for the 2020 p...
research
05/01/2023

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds

This paper describes the Ubenwa CryCeleb dataset - a labeled collection ...
research
06/17/2021

Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification

In far-field speaker verification, the performance of speaker embeddings...

Please sign up or login with your details

Forgot password? Click here to reset