Hybrid noise shaping for audio coding using perfectly overlapped window

08/24/2023
by   Byeongho Jo, et al.
0

In recent years, audio coding technology has been standardized based on several frameworks that incorporate linear predictive coding (LPC). However, coding the transient signal using frequency-domain LP residual signals remains a challenge. To address this, temporal noise shaping (TNS) can be adapted, although it cannot be effectively operated since the estimated temporal envelope in the modified discrete cosine transform (MDCT) domain is accompanied by the time-domain aliasing (TDA) terms. In this study, we propose the modulated complex lapped transform-based coding framework integrated with transform coded excitation (TCX) and complex LPC-based TNS (CTNS). Our approach uses a 50% overlap window and switching scheme for the CTNS to improve the coding efficiency. Additionally, an adaptive calculation of the target bits for the sub-bands using the frequency envelope information based on the quantized LPC coefficients is proposed. To minimize the quantization mismatch between both modes, an integrated quantization for real and complex values and a TDA augmentation method that compensates for the artificially generated TDA components during switching operations are proposed. The proposed coding framework shows a superior performance in both objective metrics and subjective listening tests, thereby demonstrating its low bit-rate audio coding.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/08/2022

High Quality Audio Coding with MDCTNet

We propose a neural audio generative model, MDCTNet, operating in the pe...
research
01/28/2022

A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain

Frequency domain processing, and in particular the use of Modified Discr...
research
04/10/2017

Robust Audio Watermarking Algorithm Based on Moving Average and DCT

Noise is often brought to host audio by common signal processing operati...
research
04/13/2019

Audio Compression Using Graph-based Transform

Graph-based Transform is one of the recent transform coding methods whic...
research
11/04/2022

Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding

Low and ultra-low-bitrate neural speech coding achieves unprecedented co...
research
01/23/2020

DCT-Conv: Coding filters in convolutional networks with Discrete Cosine Transform

Convolutional neural networks are based on a huge number of trained weig...
research
01/31/2022

PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech

The quality of speech coded by transform coding is affected by various a...

Please sign up or login with your details

Forgot password? Click here to reset