Mono-to-stereo through parametric stereo generation

06/26/2023
by   Joan Serrà, et al.
0

Generating a stereophonic presentation from a monophonic audio signal is a challenging open task, especially if the goal is to obtain a realistic spatial imaging with a specific panning of sound elements. In this work, we propose to convert mono to stereo by means of predicting parametric stereo (PS) parameters using both nearest neighbor and deep network approaches. In combination with PS, we also propose to model the task with generative approaches, allowing to synthesize multiple and equally-plausible stereo renditions from the same mono signal. To achieve this, we consider both autoregressive and masked token modelling approaches. We provide evidence that the proposed PS-based models outperform a competitive classical decorrelation baseline and that, within a PS prediction framework, modern generative models outshine equivalent non-generative counterparts. Overall, our work positions both PS and generative modelling as strong and appealing methodologies for mono-to-stereo upmixing. A discussion of the limitations of these approaches is also provided.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2020

Deep generative models for musical audio synthesis

Sound modelling is the process of developing algorithms that generate so...
research
04/20/2021

Identification of fake stereo audio

Channel is one of the important criterions for digital audio quality. Ge...
research
10/02/2018

Semi-dense Stereo Matching using Dual CNNs

A robust solution for semi-dense stereo matching is presented. It utiliz...
research
07/20/2020

Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation

Stereophonic audio is an indispensable ingredient to enhance human audit...
research
02/10/2020

Uncertainty Estimation for End-To-End Learned Dense Stereo Matching via Probabilistic Deep Learning

Motivated by the need to identify erroneous disparity assignments, vario...
research
08/20/2020

Not My Deepfake: Towards Plausible Deniability for Machine-Generated Media

Progress in generative modelling, especially generative adversarial netw...

Please sign up or login with your details

Forgot password? Click here to reset