HGCN: harmonic gated compensation network for speech enhancement

01/30/2022
by   Tianrui Wang, et al.
0

Mask processing in the time-frequency (T-F) domain through the neural network has been one of the mainstreams for single-channel speech enhancement. However, it is hard for most models to handle the situation when harmonics are partially masked by noise. To tackle this challenge, we propose a harmonic gated compensation network (HGCN). We design a high-resolution harmonic integral spectrum to improve the accuracy of harmonic locations prediction. Then we add voice activity detection (VAD) and voiced region detection (VRD) to the convolutional recurrent network (CRN) to filter harmonic locations. Finally, the harmonic gating mechanism is used to guide the compensation model to adjust the coarse results from CRN to obtain the refinedly enhanced results. Our experiments show HGCN achieves substantial gain over a number of advanced approaches in the community.

READ FULL TEXT

page 2

page 3

page 4

research
06/01/2023

Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model

With fewer feature dimensions, filter banks are often used in light-weig...
research
11/12/2019

PHASEN: A Phase-and-Harmonics-Aware Speech Enhancement Network

Time-frequency (T-F) domain masking is a mainstream approach for single-...
research
06/08/2023

Convolutional Recurrent Neural Network with Attention for 3D Speech Enhancement

3D speech enhancement can effectively improve the auditory experience an...
research
11/06/2020

Robust ENF Estimation Based on Harmonic Enhancement and Maximum Weight Clique

We present a framework for robust electric network frequency (ENF) extra...
research
08/30/2022

HPPNet: Modeling the Harmonic Structure and Pitch Invariance in Piano Transcription

While neural network models are making significant progress in piano tra...
research
12/09/2021

Harmonic and non-Harmonic Based Noisy Reverberant Speech Enhancement in Time Domain

This paper introduces the single step time domain method named HnH-NRSE,...
research
05/21/2019

Bayesian Pitch Tracking Based on the Harmonic Model

Fundamental frequency is one of the most important characteristics of sp...

Please sign up or login with your details

Forgot password? Click here to reset