Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding

11/04/2022
by   Haici Yang, et al.
0

Low and ultra-low-bitrate neural speech coding achieves unprecedented coding gain by generating speech signals from compact speech features. This paper introduces additional coding efficiency in neural speech coding by reducing the temporal redundancy existing in the frame-level feature sequence via a recurrent neural predictor. The prediction can achieve a low-entropy residual representation, which we discriminatively code based on their contribution to the signal reconstruction. The harmonization of feature prediction and discriminative coding results in a dynamic bit allocation algorithm that spends more bits on unpredictable but rare events. As a result, we develop a scalable, lightweight, low-latency, and low-bitrate neural speech coding system. We demonstrate the advantage of the proposed methods using the LPCNet as a neural vocoder. While the proposed method guarantees causality in its prediction, the subjective tests and feature space analysis show that our model achieves superior coding efficiency compared to LPCNet and Lyra V2 in the very low bitrates.

READ FULL TEXT
research
03/27/2021

Scalable and Efficient Neural Speech Coding

This work presents a scalable and efficient neural waveform codec (NWC) ...
research
04/05/2022

Non-Linear Speech coding with MLP, RBF and Elman based prediction

In this paper we propose a nonlinear scalar predictor based on a combina...
research
07/07/2022

NESC: Robust Neural End-2-End Speech Coding with GANs

Neural networks have proven to be a formidable tool to tackle the proble...
research
06/18/2019

Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding

Speech codecs learn compact representations of speech signals to facilit...
research
03/24/2022

A new subband non linear prediction coding algorithm for narrowband speech signal: The nADPCMB MLT coding scheme

This paper focuses on a newly developed transparent nADPCMB MLT speech c...
research
08/24/2023

Hybrid noise shaping for audio coding using perfectly overlapped window

In recent years, audio coding technology has been standardized based on ...
research
03/08/2022

Practical cognitive speech compression

This paper presents a new neural speech compression method that is pract...

Please sign up or login with your details

Forgot password? Click here to reset