Deep Tensor Factorization for Spatially-Aware Scene Decomposition

05/03/2019
by   Jonah Casebeer, et al.
0

We propose a completely unsupervised method to understand audio scenes observed with random microphone arrangements by decomposing the scene into its constituent sources and their relative presence in each microphone. To this end, we formulate a neural network architecture that can be interpreted as a nonnegative tensor factorization of a multi-channel audio recording. By clustering on the learned network parameters corresponding to channel content, we can learn sources' individual spectral dictionaries and their activation patterns over time. Our method allows us to leverage deep learning advances like end-to-end training, while also allowing stochastic minibatch training so that we can feasibly decompose realistic audio scenes that are intractable to decompose using standard methods. This neural network architecture is easily extensible to other kinds of tensor factorizations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2019

Nonnegative Canonical Polyadic Decomposition with Rank Deficient Factors

Recently, there is an emerging interest for applications of tensor facto...
research
07/05/2019

Deep Neural Baselines for Computational Paralinguistics

Detecting sleepiness from spoken language is an ambitious task, which is...
research
02/14/2022

Fast algorithm for overcomplete order-3 tensor decomposition

We develop the first fast spectral algorithm to decompose a random third...
research
09/01/2021

Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization

When we place microphones close to a sound source near other sources in ...
research
11/27/2022

GRelPose: Generalizable End-to-End Relative Camera Pose Regression

This paper proposes a generalizable, end-to-end deep learning-based meth...
research
12/05/2022

D-TensoRF: Tensorial Radiance Fields for Dynamic Scenes

Neural radiance field (NeRF) attracts attention as a promising approach ...
research
04/17/2021

Uncovering audio patterns in music with Nonnegative Tucker Decomposition for structural segmentation

Recent work has proposed the use of tensor decomposition to model repeti...

Please sign up or login with your details

Forgot password? Click here to reset