Listen to What You Want: Neural Network-based Universal Sound Selector

06/10/2020
by   Tsubasa Ochiai, et al.
0

Being able to control the acoustic events (AEs) to which we want to listen would allow the development of more controllable hearable devices. This paper addresses the AE sound selection (or removal) problems, that we define as the extraction (or suppression) of all the sounds that belong to one or multiple desired AE classes. Although this problem could be addressed with a combination of source separation followed by AE classification, this is a sub-optimal way of solving the problem. Moreover, source separation usually requires knowing the maximum number of sources, which may not be practical when dealing with AEs. In this paper, we propose instead a universal sound selection neural network that enables to directly select AE sounds from a mixture given user-specified target AE classes. The proposed framework can be explicitly optimized to simultaneously select sounds from multiple desired AE classes, independently of the number of sources in the mixture. We experimentally show that the proposed method achieves promising AE sound selection performance and could be generalized to mixtures with a number of sources that are unseen during training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2020

Source separation with weakly labelled data: An approach to computational auditory scene analysis

Source separation is the task to separate an audio recording into indivi...
research
06/14/2021

Few-shot learning of new sound classes for target sound extraction

Target sound extraction consists of extracting the sound of a target aco...
research
05/19/2023

Direction Specific Ambisonics Source Separation with End-To-End Deep Learning

Ambisonics is a scene-based spatial audio format that has several useful...
research
11/18/2019

Improving Universal Sound Separation Using Sound Classification

Deep learning approaches have recently achieved impressive performance o...
research
10/04/2019

Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source Separation

Typical methods for binaural source separation consider only the direct ...
research
08/06/2021

RadioMic: Sound Sensing via mmWave Signals

Voice interfaces has become an integral part of our lives, with the prol...
research
04/08/2022

SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning

In many situations, we would like to hear desired sound events (SEs) whi...

Please sign up or login with your details

Forgot password? Click here to reset