STC speaker recognition systems for the NIST SRE 2021

11/03/2021
by   Anastasia Avdeeva, et al.
0

This paper presents a description of STC Ltd. systems submitted to the NIST 2021 Speaker Recognition Evaluation for both fixed and open training conditions. These systems consists of a number of diverse subsystems based on using deep neural networks as feature extractors. During the NIST 2021 SRE challenge we focused on the training of the state-of-the-art deep speaker embeddings extractors like ResNets and ECAPA networks by using additive angular margin based loss functions. Additionally, inspired by the recent success of the wav2vec 2.0 features in automatic speech recognition we explored the effectiveness of this approach for the speaker verification filed. According to our observation the fine-tuning of the pretrained large wav2vec 2.0 model provides our best performing systems for open track condition. Our experiments with wav2vec 2.0 based extractors for the fixed condition showed that unsupervised autoregressive pretraining with Contrastive Predictive Coding loss opens the door to training powerful transformer-based extractors from raw speech signals. For video modality we developed our best solution with RetinaFace face detector and deep ResNet face embeddings extractor trained on large face image datasets. The final results for primary systems were obtained by different configurations of subsystems fusion on the score level followed by score calibration.

READ FULL TEXT
research
03/28/2022

Robust Speaker Recognition with Transformers Using wav2vec 2.0

Recent advances in unsupervised speech representation learning discover ...
research
10/23/2020

The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description

In this technical report we describe the IDLAB top-scoring submissions f...
research
10/21/2020

The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification

In this paper we propose and analyse a large margin fine-tuning strategy...
research
07/13/2019

BUT VOiCES 2019 System Description

This is a description of our effort in VOiCES 2019 Speaker Recognition c...
research
10/31/2020

The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020

This report describes the systems submitted to the first and second trac...
research
10/16/2019

BUT System Description to VoxCeleb Speaker Recognition Challenge 2019

In this report, we describe the submission of Brno University of Technol...
research
02/02/2021

A Speaker Verification Backend with Robust Performance across Conditions

In this paper, we address the problem of speaker verification in conditi...

Please sign up or login with your details

Forgot password? Click here to reset