HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

03/24/2022
by   Pavel Andreev, et al.
0

Generative adversarial networks have recently demonstrated outstanding performance in neural vocoding outperforming best autoregressive and flow-based models. In this paper, we show that this success can be extended to other tasks of conditional audio generation. In particular, building upon HiFi vocoders, we propose a novel HiFi++ general framework for neural vocoding, bandwidth extension, and speech enhancement. We show that with the improved generator architecture and simplified multi-discriminator training, HiFi++ performs on par with the state-of-the-art in these tasks while spending significantly less memory and computational resources. The effectiveness of our approach is validated through a series of extensive experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2019

Perceptual Speech Enhancement via Generative Adversarial Networks

Automatic speech recognition (ASR) systems are of vital importance nowad...
research
11/04/2022

Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration

Diffusion-based generative models have had a high impact on the computer...
research
02/20/2020

iSEGAN: Improved Speech Enhancement Generative Adversarial Networks

Popular neural network-based speech enhancement systems operate on the m...
research
03/17/2023

Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture

This paper presents a configurable version of Extreme Bandwidth Extensio...
research
10/25/2022

EBEN: Extreme bandwidth extension network applied to speech signals captured with noise-resilient microphones

In this paper, we present Extreme Bandwidth Extension Network (EBEN), a ...
research
10/21/2022

Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation

Deep generative models for Speech Enhancement (SE) received increasing a...
research
02/24/2022

On the relevance of bandwidth extension for speaker identification

In this paper we discuss the relevance of bandwidth extension for speake...

Please sign up or login with your details

Forgot password? Click here to reset