Parameterized Channel Normalization for Far-field Deep Speaker Verification

09/24/2021
by   Xuechen Liu, et al.
0

We address far-field speaker verification with deep neural network (DNN) based speaker embedding extractor, where mismatch between enrollment and test data often comes from convolutive effects (e.g. room reverberation) and noise. To mitigate these effects, we focus on two parametric normalization methods: per-channel energy normalization (PCEN) and parameterized cepstral mean normalization (PCMN). Both methods contain differentiable parameters and thus can be conveniently integrated to, and jointly optimized with the DNN using automatic differentiation methods. We consider both fixed and trainable (data-driven) variants of each method. We evaluate the performance on Hi-MIA, a recent large-scale far-field speech corpus, with varied microphone and positional settings. Our methods outperform conventional mel filterbank features, with maximum of 33.5 rate under matched microphone and mismatched microphone conditions, respectively.

READ FULL TEXT
research
10/17/2022

How to Leverage DNN-based speech enhancement for multi-channel speaker verification?

Speaker verification (SV) suffers from unsatisfactory performance in far...
research
09/24/2021

Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification

After their introduction to robust speech recognition, power normalized ...
research
11/01/2019

Long-distance Detection of Bioacoustic Events with Per-channel Energy Normalization

This paper proposes to perform unsupervised detection of bioacoustic eve...
research
02/02/2020

The FFSVC 2020 Evaluation Plan

The Far-Field Speaker Verification Challenge 2020 (FFSVC20) is designed ...
research
04/12/2019

STC Speaker Recognition Systems for the VOiCES From a Distance Challenge

This paper presents the Speech Technology Center (STC) speaker recogniti...
research
12/03/2019

HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the Baselines

This paper presents a large far-field text-dependent speaker verificatio...
research
02/20/2021

Learnable MFCCs for Speaker Verification

We propose a learnable mel-frequency cepstral coefficient (MFCC) fronten...

Please sign up or login with your details

Forgot password? Click here to reset