Foreground-Background Ambient Sound Scene Separation

05/11/2020
by   Michel Olvera, et al.
0

Ambient sound scenes typically comprise multiple short events occurring on top of a somewhat stationary background. We consider the task of separating these events from the background, which we call foreground-background ambient sound scene separation. We propose a deep learning-based separation framework with a suitable feature normaliza-tion scheme and an optional auxiliary network capturing the background statistics, and we investigate its ability to handle the great variety of sound classes encountered in ambient sound scenes, which have often not been seen in training. To do so, we create single-channel foreground-background mixtures using isolated sounds from the DESED and Audioset datasets, and we conduct extensive experiments with mixtures of seen or unseen sound classes at various signal-to-noise ratios. Our experimental findings demonstrate the generalization ability of the proposed approach.

READ FULL TEXT
research
11/10/2021

Structure from Silence: Learning Scene Structure from Ambient Sound

From whirling ceiling fans to ticking clocks, the sounds that we hear su...
research
11/16/2019

VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation

The performance of sound event detection methods can significantly degra...
research
03/23/2021

GISE-51: A scalable isolated sound events dataset

Most of the existing isolated sound event datasets comprise a small numb...
research
06/05/2022

Geometrically-Motivated Primary-Ambient Decomposition With Center-Channel Extraction

A geometrically-motivated method for primary-ambient decomposition is pr...
research
05/05/2021

Self-Supervised Learning from Automatically Separated Sound Scenes

Real-world sound scenes consist of time-varying collections of sound sou...
research
07/02/2019

WHAM!: Extending Speech Separation to Noisy Environments

Recent progress in separating the speech signals from multiple overlappi...
research
12/20/2017

Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning

The sound of crashing waves, the roar of fast-moving cars -- sound conve...

Please sign up or login with your details

Forgot password? Click here to reset