The HCCL System for the NIST SRE21

07/11/2022
by   Zhuo Li, et al.
0

This paper describes the systems developed by the HCCL team for the NIST 2021 speaker recognition evaluation (NIST SRE21).We first explore various state-of-the-art speaker embedding extractors combined with a novel circle loss to obtain discriminative deep speaker embeddings. Considering that cross-channel and cross-linguistic speaker recognition are the key challenges of SRE21, we introduce several techniques to reduce the cross-domain mismatch. Specifically, Codec and speech enhancement are directly applied to the raw speech to eliminate the codecs and the environment noise mismatch. We denote the methods that work directly on speech to eliminate the relatively explicit mismatches collectively as data adaptation methods. Experiments show that data adaption methods achieve 15% improvements over our baseline. Furthermore, some popular back-ends domain adaptation algorithms are deployed on speaker embeddings to alleviate speaker performance degradation caused by the implicit mismatch. Score calibration is a major failure for us in SRE21. The reason is that score calibration with too many parameters easily lead to overfitting problems.

READ FULL TEXT
research
11/17/2020

Adversarial Training for Multi-domain Speaker Recognition

In real-life applications, the performance of speaker recognition system...
research
10/24/2016

UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation

This document briefly describes the systems submitted by the Center for ...
research
09/18/2023

Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders

We propose a novel framework for electrolaryngeal speech intelligibility...
research
10/27/2020

Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification

Domain mismatch often occurs in real applications and causes serious per...
research
09/05/2020

Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification

Forensic audio analysis for speaker verification offers unique challenge...
research
11/06/2019

The Speed Submission to DIHARD II: Contributions Lessons Learned

This paper describes the speaker diarization systems developed for the S...
research
02/25/2019

Channel adversarial training for cross-channel text-independent speaker recognition

The conventional speaker recognition frameworks (e.g., the i-vector and ...

Please sign up or login with your details

Forgot password? Click here to reset