A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain

01/28/2022
by   Kishan Gupta, et al.
0

Frequency domain processing, and in particular the use of Modified Discrete Cosine Transform (MDCT), is the most widespread approach to audio coding. However, at low bitrates, audio quality, especially for speech, degrades drastically due to the lack of available bits to directly code the transform coefficients. Traditionally, post-filtering has been used to mitigate artefacts in the coded speech by exploiting a-priori information of the source and extra transmitted parameters. Recently, data-driven post-filters have shown better results, but at the cost of significant additional complexity and delay. In this work, we propose a mask-based post-filter operating directly in MDCT domain of the codec, inducing no extra delay. The real-valued mask is applied to the quantized MDCT coefficients and is estimated from a relatively lightweight convolutional encoder-decoder network. Our solution is tested on the recently standardized low-delay, low-complexity codec (LC3) at lowest possible bitrate of 16 kbps. Objective and subjective assessments clearly show the advantage of this approach over the conventional post-filter, with an average improvement of 10 MUSHRA points over the LC3 coded speech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2022

PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech

The quality of speech coded by transform coding is affected by various a...
research
10/12/2020

Enhancement Of Coded Speech Using a Mask-Based Post-Filter

The quality of speech codecs deteriorates at low bitrates due to high qu...
research
08/24/2023

Hybrid noise shaping for audio coding using perfectly overlapped window

In recent years, audio coding technology has been standardized based on ...
research
06/25/2018

Convolutional Neural Networks to Enhance Coded Speech

Enhancing coded speech suffering from far-end acoustic background noise,...
research
08/24/2020

The economic value of additional airport departure capacity

This article presents a model for the economic value of extra capacity a...
research
08/25/2020

Transmitting Extra Bits by Rotating Signal Constellations

In this letter, we propose a novel LDPC coding scheme to transmit extra ...
research
04/12/2023

A Low-Complexity Post-Weighting Predistorter in a mMIMO Transmitter Under Crosstalk

In hybrid beamforming, the beam oriented digital predistortion (BO-DPD) ...

Please sign up or login with your details

Forgot password? Click here to reset