PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech

01/31/2022
by   Srikanth Korse, et al.
0

The quality of speech coded by transform coding is affected by various artefacts especially when bitrates to quantize the frequency components become too low. In order to mitigate these coding artefacts and enhance the quality of coded speech, a post-processor that relies on a-priori information transmitted from the encoder is traditionally employed at the decoder side. In recent years, several data-driven post-postprocessors have been proposed which were shown to outperform traditional approaches. In this paper, we propose PostGAN, a GAN-based neural post-processor that operates in the sub-band domain and relies on the U-Net architecture and a learned affine transform. It has been tested on the recently standardized low-complexity, low-delay bluetooth codec (LC3) for wideband speech at the lowest bitrate (16 kbit/s). Subjective evaluations and objective scores show that the newly introduced post-processor surpasses previously published methods and can improve the quality of coded speech by around 20 MUSHRA points.

READ FULL TEXT
research
01/28/2022

A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain

Frequency domain processing, and in particular the use of Modified Discr...
research
07/25/2023

CQNV: A combination of coarsely quantized bitstream and neural vocoder for low rate speech coding

Recently, speech codecs based on neural networks have proven to perform ...
research
10/12/2020

Enhancement Of Coded Speech Using a Mask-Based Post-Filter

The quality of speech codecs deteriorates at low bitrates due to high qu...
research
06/25/2018

Convolutional Neural Networks to Enhance Coded Speech

Enhancing coded speech suffering from far-end acoustic background noise,...
research
08/09/2021

A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate

Recently, GAN vocoders have seen rapid progress in speech synthesis, sta...
research
03/08/2022

Practical cognitive speech compression

This paper presents a new neural speech compression method that is pract...
research
08/24/2023

Hybrid noise shaping for audio coding using perfectly overlapped window

In recent years, audio coding technology has been standardized based on ...

Please sign up or login with your details

Forgot password? Click here to reset