Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model

05/11/2022
by   Jean-Marc Valin, et al.
0

As deep speech enhancement algorithms have recently demonstrated capabilities greatly surpassing their traditional counterparts for suppressing noise, reverberation and echo, attention is turning to the problem of packet loss concealment (PLC). PLC is a challenging task because it not only involves real-time speech synthesis, but also frequent transitions between the received audio and the synthesized concealment. We propose a hybrid neural PLC architecture where the missing speech is synthesized using a generative model conditioned using a predictive model. The resulting algorithm achieves natural concealment that surpasses the quality of existing conventional PLC algorithms and ranked second in the Interspeech 2022 PLC Challenge. We show that our solution not only works for uncompressed audio, but is also applicable to a modern speech codec.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2019

Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoder

Variational auto-encoders (VAEs) are deep generative latent variable mod...
research
02/26/2023

Contrast-PLC: Contrastive Learning for Packet Loss Concealment

Packet loss concealment (PLC) is challenging in concealing missing conte...
research
03/23/2023

A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI

Generative AI has demonstrated impressive performance in various fields,...
research
07/03/2022

Towards Error-Resilient Neural Speech Coding

Neural audio coding has shown very promising results recently in the lit...
research
11/08/2022

Improving performance of real-time full-band blind packet-loss concealment with predictive network

Packet loss concealment (PLC) is a tool for enhancing speech degradation...
research
07/04/2022

TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network

Real-time communications in packet-switched networks have become widely ...
research
09/03/2020

Detection of AI-Synthesized Speech Using Cepstral Bispectral Statistics

Digital technology has made possible unimaginable applications come true...

Please sign up or login with your details

Forgot password? Click here to reset