Denoising Deep Neural Networks Based Voice Activity Detection

03/04/2013
by   Xiao-Lei Zhang, et al.
0

Recently, the deep-belief-networks (DBN) based voice activity detection (VAD) has been proposed. It is powerful in fusing the advantages of multiple features, and achieves the state-of-the-art performance. However, the deep layers of the DBN-based VAD do not show an apparent superiority to the shallower layers. In this paper, we propose a denoising-deep-neural-network (DDNN) based VAD to address the aforementioned problem. Specifically, we pre-train a deep neural network in a special unsupervised denoising greedy layer-wise mode, and then fine-tune the whole network in a supervised way by the common back-propagation algorithm. In the pre-training phase, we take the noisy speech signals as the visible layer and try to extract a new feature that minimizes the reconstruction cross-entropy loss between the noisy speech signals and its corresponding clean speech signals. Experimental results show that the proposed DDNN-based VAD not only outperforms the DBN-based VAD but also shows an apparent performance improvement of the deep layers over shallower layers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2019

Self-Supervised Deep Learning-Based Speech Denoising

This paper presents a self-supervised deep neural network solution to sp...
research
04/06/2020

Simultaneous Denoising and Dereverberation Using Deep Embedding Features

Monaural speech dereverberation is a very challenging task because no sp...
research
12/22/2019

End-Point Detection with State Transition Model based on Chunk-Wise Classification

A state transition model (STM) based on chunk-wise classification was pr...
research
06/01/2013

An Analysis of the Connections Between Layers of Deep Neural Networks

We present an analysis of different techniques for selecting the connect...
research
08/13/2020

MLNET: An Adaptive Multiple Receptive-field Attention Neural Network for Voice Activity Detection

Voice activity detection (VAD) makes a distinction between speech and no...
research
10/28/2022

SG-VAD: Stochastic Gates Based Speech Activity Detection

We propose a novel voice activity detection (VAD) model in a low-resourc...
research
06/25/2021

Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets

We address voice activity detection in acoustic environments of transien...

Please sign up or login with your details

Forgot password? Click here to reset