FFC-SE: Fast Fourier Convolution for Speech Enhancement

04/06/2022
by   Ivan Shchekotov, et al.
0

Fast Fourier convolution (FFC) is the recently proposed neural operator showing promising performance in several computer vision problems. The FFC operator allows employing large receptive field operations within early layers of the neural network. It was shown to be especially helpful for inpainting of periodic structures which are common in audio processing. In this work, we design neural network architectures which adapt FFC for speech enhancement. We hypothesize that a large receptive field allows these networks to produce more coherent phases than vanilla convolutional models, and validate this hypothesis experimentally. We found that neural networks based on Fast Fourier convolution outperform analogous convolutional models and show better or comparable results with other speech enhancement baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2023

PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement

Convolutional neural networks (CNN) and Transformer have wildly succeede...
research
10/28/2022

Speech Enhancement with Intelligent Neural Homomorphic Synthesis

Most neural network speech enhancement models ignore speech production m...
research
02/20/2020

Efficient Trainable Front-Ends for Neural Speech Enhancement

Many neural speech enhancement and source separation systems operate in ...
research
10/08/2022

Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs

Transformer models have made tremendous progress in various fields in re...
research
03/31/2022

Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain

Score-based generative models (SGMs) have recently shown impressive resu...
research
04/13/2022

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation

Speech dereverberation is often an important requirement in robust speec...
research
07/27/2019

Dilated FCN: Listening Longer to Hear Better

Deep neural network solutions have emerged as a new and powerful paradig...

Please sign up or login with your details

Forgot password? Click here to reset