On the use of DNN Autoencoder for Robust Speaker Recognition

11/07/2018
by   Ondrej Novotny, et al.
0

In this paper, we present an analysis of a DNN-based autoencoder for speech enhancement, dereverberation and denoising. The target application is a robust speaker recognition system. We started with augmenting the Fisher database with artificially noised and reverberated data and we trained the autoencoder to map noisy and reverberated speech to its clean version. We use the autoencoder as a preprocessing step for a state-of-the-art text-independent speaker recognition system. We compare results achieved with pure autoencoder enhancement, multi-condition PLDA training and their simultaneous use. We present a detailed analysis with various conditions of NIST SRE 2010, PRISM and artificially corrupted NIST SRE 2010 telephone condition. We conclude that the proposed preprocessing significantly outperforms the baseline and that this technique can be used to build a robust speaker recognition system for reverberated and noisy data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2018

Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition

In this work, we present an analysis of a DNN-based autoencoder for spee...
research
03/13/2020

End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification

Speech 'in-the-wild' is a handicap for speaker recognition systems due t...
research
11/02/2022

Analysis of Noisy-target Training for DNN-based speech enhancement

Deep neural network (DNN)-based speech enhancement usually uses a clean ...
research
12/01/2022

Deep neural network techniques for monaural speech enhancement: state of the art analysis

Deep neural networks (DNN) techniques have become pervasive in domains s...
research
11/14/2022

Multi-Label Training for Text-Independent Speaker Identification

In this paper, we propose a novel strategy for text-independent speaker ...
research
01/06/2020

Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders

Deep learning-based models have greatly advanced the performance of spee...
research
03/23/2018

Exploring the robustness of features and enhancement on speech recognition systems in highly-reverberant real environments

This paper evaluates the robustness of a DNN-HMM-based speech recognitio...

Please sign up or login with your details

Forgot password? Click here to reset