Listening for Sirens: Locating and Classifying Acoustic Alarms in City Scenes

10/11/2018
by   Letizia Marchegiani, et al.
0

This paper is about alerting acoustic event detection and sound source localisation in an urban scenario. Specifically, we are interested in spotting the presence of horns, and sirens of emergency vehicles. In order to obtain a reliable system able to operate robustly despite the presence of traffic noise, which can be copious, unstructured and unpredictable, we propose to treat the spectrograms of incoming stereo signals as images, and apply semantic segmentation, based on a Unet architecture, to extract the target sound from the background noise. In a multi-task learning scheme, together with signal denoising, we perform acoustic event classification to identify the nature of the alerting sound. Lastly, we use the denoised signals to localise the acoustic source on the horizon plane, by regressing the direction of arrival of the sound through a CNN architecture. Our experimental evaluation shows an average classification rate of 94 localisation of 7.5 when operating on audio frames of 0.5s, and of 2.5 when operating on frames of 2.5s. The system offers excellent performance in particularly challenging scenarios, where the noise level is remarkably high.

READ FULL TEXT

page 1

page 4

research
04/11/2019

Cross-task learning for audio tagging, sound event detection spatial localization: DCASE 2019 baseline systems

The Detection and Classification of Acoustic Scenes and Events (DCASE) 2...
research
04/06/2019

Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems

The Detection and Classification of Acoustic Scenes and Events (DCASE) 2...
research
10/27/2022

One-Shot Acoustic Matching Of Audio Signals – Learning to Hear Music In Any Room/ Concert Hall

The acoustic space in which a sound is created and heard plays an essent...
research
12/19/2021

Detect what you want: Target Sound Detection

Human beings can perceive a target sound that we are interested in from ...
research
07/12/2016

City-Identification of Flickr Videos Using Semantic Acoustic Features

City-identification of videos aims to determine the likelihood of a vide...
research
08/15/2021

ALTo: Ad Hoc High-Accuracy Touch Interaction Using Acoustic Localization

Millions of people around the world face motor impairments due to Parkin...
research
08/05/2019

Presenting the Acoustic Sounds for Wellbeing Dataset and Baseline Classification Results

The field of sound healing includes ancient practices coming from a broa...

Please sign up or login with your details

Forgot password? Click here to reset