Geometrically-Motivated Primary-Ambient Decomposition With Center-Channel Extraction

06/05/2022
by   Jouni Paulus, et al.
0

A geometrically-motivated method for primary-ambient decomposition is proposed and evaluated in an up-mixing application. The method consists of two steps, accommodating a particularly intuitive explanation. The first step consists of signal-adaptive rotations applied on the input stereo scene, which translate the primary sound sources into the center of the rotated scene. The second step applies a center-channel extraction method, based on a simple signal model and optimal in the mean-squared-error sense. The performance is evaluated by using the estimated ambient component to enable surround sound starting from real-world stereo signals. The participants in the reported listening test are asked to adjust the audio scene envelopment and find the audio settings that pleases them the most. The possibility for up-mixing enabled by the proposed method is used extensively, and the user satisfaction is significantly increased compared to the original stereo mix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/10/2021

Structure from Silence: Learning Scene Structure from Ambient Sound

From whirling ceiling fans to ticking clocks, the sounds that we hear su...
research
03/01/2023

Event Fusion Photometric Stereo Network

We introduce a novel method to estimate surface normal of an object in a...
research
05/11/2020

Foreground-Background Ambient Sound Scene Separation

Ambient sound scenes typically comprise multiple short events occurring ...
research
04/20/2021

Identification of fake stereo audio

Channel is one of the important criterions for digital audio quality. Ge...
research
09/10/2020

Speaker Diarization Using Stereo Audio Channels: Preliminary Study on Utterance Clustering

Speaker diarization is one of the actively researched topics in audio si...
research
11/25/2022

Stereo Speech Enhancement Using Custom Mid-Side Signals and Monaural Processing

Speech Enhancement (SE) systems typically operate on monaural input and ...
research
11/30/2022

Extreme Audio Time Stretching Using Neural Synthesis

A deep neural network solution for time-scale modification (TSM) focused...

Please sign up or login with your details

Forgot password? Click here to reset