Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism

05/21/2020
by   Wang Dai, et al.
0

Formant tracking is one of the most fundamental problems in speech processing. Traditionally, formants are estimated using signal processing methods. Recent studies showed that generic convolutional architectures can outperform recurrent networks on temporal tasks such as speech synthesis and machine translation. In this paper, we explored the use of Temporal Convolutional Network (TCN) for formant tracking. In addition to the conventional implementation, we modified the architecture from three aspects. First, we turned off the "causal" mode of dilated convolution, making the dilated convolution see the future speech frames. Second, each hidden layer reused the output information from all the previous layers through dense connection. Third, we also adopted a gating mechanism to alleviate the problem of gradient disappearance by selectively forgetting unimportant information. The model was validated on the open access formant database VTR. The experiment showed that our proposed model was easy to converge and achieved an overall mean absolute percent error (MAPE) of 8.2 to three competitive baselines of 9.4

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2022

Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation

Speech dereverberation is an important stage in many speech technology a...
research
12/06/2018

Frequency Tracking: LMS and RLS Applied to Speech Formant Estimation (2000)

Introduction Several speech processing algorithms assume the signal is s...
research
04/12/2021

Complex Spectral Mapping With Attention Based Convolution Recurrent Neural Network for Speech Enhancement

Speech enhancement has benefited from the success of deep learning in te...
research
08/26/2019

Deep Concept-wise Temporal Convolutional Networks for Action Localization

Existing action localization approaches adopt shallow temporal convoluti...
research
03/04/2018

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

For most deep learning practitioners, sequence modeling is synonymous wi...
research
08/24/2018

ParaNet - Using Dense Blocks for Early Inference

DenseNets have been shown to be a competitive model among recent convolu...
research
03/23/2018

What Do We Understand About Convolutional Networks?

This document will review the most prominent proposals using multilayer ...

Please sign up or login with your details

Forgot password? Click here to reset