A Deep Learning Approach for Low-Latency Packet Loss Concealment of Audio Signals in Networked Music Performance Applications

07/14/2020
by   Prateek Verma, et al.
0

Networked Music Performance (NMP) is envisioned as a potential game changer among Internet applications: it aims at revolutionizing the traditional concept of musical interaction by enabling remote musicians to interact and perform together through a telecommunication network. Ensuring realistic conditions for music performance, however, constitutes a significant engineering challenge due to extremely strict requirements in terms of audio quality and, most importantly, network delay. To minimize the end-to-end delay experienced by the musicians, typical implementations of NMP applications use un-compressed, bidirectional audio streams and leverage UDP as transport protocol. Being connection less and unreliable,audio packets transmitted via UDP which become lost in transit are not re-transmitted and thus cause glitches in the receiver audio playout. This article describes a technique for predicting lost packet content in real-time using a deep learning approach. The ability of concealing errors in real time can help mitigate audio impairments caused by packet losses, thus improving the quality of audio playout in real-world scenarios.

READ FULL TEXT
research
11/08/2022

Improving performance of real-time full-band blind packet-loss concealment with predictive network

Packet loss concealment (PLC) is a tool for enhancing speech degradation...
research
04/11/2022

INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge

Audio Packet Loss Concealment (PLC) is the hiding of gaps in audio strea...
research
08/29/2018

Enabling Ultra-Low Delay Teleorchestras using Software Defined Networking

Ultra-low delay sensitive applications can afford delay only at the leve...
research
05/15/2020

ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition

Packet loss is a common problem in data transmission, including speech d...
research
10/25/2020

Enactive Mandala: Audio-visualizing Brain Waves

We are exploring the design and implementation of artificial expressions...
research
03/22/2023

Dynamic Reliability: Reliably Sending Unreliable Data

5G and Beyond networks promise low-latency support for applications that...
research
08/08/2021

Audio Spectral Enhancement: Leveraging Autoencoders for Low Latency Reconstruction of Long, Lossy Audio Sequences

With active research in audio compression techniques yielding substantia...

Please sign up or login with your details

Forgot password? Click here to reset