DeepAI AI Chat
Log In Sign Up

A Deep Learning Approach for Low-Latency Packet Loss Concealment of Audio Signals in Networked Music Performance Applications

by   Prateek Verma, et al.
Politecnico di Torino
Stanford University
Politecnico di Milano

Networked Music Performance (NMP) is envisioned as a potential game changer among Internet applications: it aims at revolutionizing the traditional concept of musical interaction by enabling remote musicians to interact and perform together through a telecommunication network. Ensuring realistic conditions for music performance, however, constitutes a significant engineering challenge due to extremely strict requirements in terms of audio quality and, most importantly, network delay. To minimize the end-to-end delay experienced by the musicians, typical implementations of NMP applications use un-compressed, bidirectional audio streams and leverage UDP as transport protocol. Being connection less and unreliable,audio packets transmitted via UDP which become lost in transit are not re-transmitted and thus cause glitches in the receiver audio playout. This article describes a technique for predicting lost packet content in real-time using a deep learning approach. The ability of concealing errors in real time can help mitigate audio impairments caused by packet losses, thus improving the quality of audio playout in real-world scenarios.


Improving performance of real-time full-band blind packet-loss concealment with predictive network

Packet loss concealment (PLC) is a tool for enhancing speech degradation...

INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge

Audio Packet Loss Concealment (PLC) is the hiding of gaps in audio strea...

Enabling Ultra-Low Delay Teleorchestras using Software Defined Networking

Ultra-low delay sensitive applications can afford delay only at the leve...

ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition

Packet loss is a common problem in data transmission, including speech d...

Enactive Mandala: Audio-visualizing Brain Waves

We are exploring the design and implementation of artificial expressions...

libACA, pyACA, and ACA-Code: Audio Content Analysis in 3 Languages

The three packages libACA, pyACA, and ACA-Code provide reference impleme...

MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing

Deep learning methods have brought substantial advancements in speech se...