Enhancement Of Coded Speech Using a Mask-Based Post-Filter

10/12/2020
by   Srikanth Korse, et al.
0

The quality of speech codecs deteriorates at low bitrates due to high quantization noise. A post-filter is generally employed to enhance the quality of the coded speech. In this paper, a data-driven post-filter relying on masking in the time-frequency domain is proposed. A fully connected neural network (FCNN), a convolutional encoder-decoder (CED) network and a long short-term memory (LSTM) network are implemeted to estimate a real-valued mask per time-frequency bin. The proposed models were tested on the five lowest operating modes (6.65 kbps-15.85 kbps) of the Adaptive Multi-Rate Wideband codec (AMR-WB). Both objective and subjective evaluations confirm the enhancement of the coded speech and also show the superiority of the mask-based neural network system over a conventional heuristic post-filter used in the standard like ITU-T G.718.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2022

A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain

Frequency domain processing, and in particular the use of Modified Discr...
research
01/31/2022

PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech

The quality of speech coded by transform coding is affected by various a...
research
05/21/2019

DNN-Based Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement

Multi-frame approaches for single-microphone speech enhancement, e.g., t...
research
06/25/2018

Convolutional Neural Networks to Enhance Coded Speech

Enhancing coded speech suffering from far-end acoustic background noise,...
research
02/14/2022

Multi-Task Deep Residual Echo Suppression with Echo-aware Loss

This paper introduces the NWPU Team's entry to the ICASSP 2022 AEC Chall...
research
07/26/2018

Towards a Deep Unified Framework for Nuclear Reactor Perturbation Analysis

This paper proposes the first step towards a novel unified framework for...
research
07/24/2019

A neural network based post-filter for speech-driven head motion synthesis

Despite the fact that neural networks are widely used for speech-driven ...

Please sign up or login with your details

Forgot password? Click here to reset