High Quality Audio Coding with MDCTNet

12/08/2022
by   Grant Davidson, et al.
0

We propose a neural audio generative model, MDCTNet, operating in the perceptually weighted domain of an adaptive modified discrete cosine transform (MDCT). The architecture of the model captures correlations in both time and frequency directions with recurrent layers (RNNs). An audio coding system is obtained by training MDCTNet on a diverse set of fullband monophonic audio signals at 48 kHz sampling, conditioned by a perceptual audio encoder. In a subjective listening test with ten excerpts chosen to be balanced across content types, yet stressful for both codecs, the mean performance of the proposed system for 24 kb/s variable bitrate (VBR) is similar to that of Opus at twice the bitrate.

READ FULL TEXT
research
08/24/2023

Hybrid noise shaping for audio coding using perfectly overlapped window

In recent years, audio coding technology has been standardized based on ...
research
12/31/2020

Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding

Conventional audio coding technologies commonly leverage human perceptio...
research
06/17/2021

PixInWav: Residual Steganography for Hiding Pixels in Audio

Steganography comprises the mechanics of hiding data in a host media tha...
research
06/16/2021

WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution

Audio super-resolution is the task of constructing a high-resolution (HR...
research
06/04/2019

MelNet: A Generative Model for Audio in the Frequency Domain

Capturing high-level structure in audio waveforms is challenging because...
research
03/05/2020

Sparse and Cosparse Audio Dequantization Using Convex Optimization

The paper shows the potential of sparsity-based methods in restoring qua...
research
05/31/2023

DC CoMix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer

Despite the huge successes made in neutral TTS, content-leakage remains ...

Please sign up or login with your details

Forgot password? Click here to reset