City classification from multiple real-world sound scenes

05/02/2019
by   Helen L Bear, et al.
0

The majority of sound scene analysis work focuses on one of two clearly defined tasks: acoustic scene classification or sound event detection. Whilst this separation of tasks is useful for problem definition, they inherently ignore some subtleties of the real-world, in particular how humans vary in how they describe a scene. Some will describe the weather and features within it, others will use a holistic descriptor like 'park', and others still will use unique identifiers such as cities or names. In this paper, we undertake the task of automatic city classification to ask whether we can recognize a city from a set of sound scenes? In this problem each each city has recordings from multiple scenes. We test a series of methods for this novel task and show that whilst a simple convolutional neural network (CNN) can achieve accuracy of 50 which is less than the acoustic scene classification task baseline in the DCASE 2018 ASC challenge (on the same data), with a simple adaptation to the class labels to use paired city labels with grouped scenes, accuracy increases to 52 the problem in a multi-task learning framework and achieve an accuracy of 56 outperforming the aforementioned approaches.

READ FULL TEXT
research
04/23/2019

Towards joint sound scene and polyphonic sound event recognition

Acoustic Scene Classification (ASC) and Sound Event Detection (SED) are ...
research
02/28/2023

Incremental Learning of Acoustic Scenes and Sound Events

In this paper, we propose a method for incremental learning of two disti...
research
07/12/2016

City-Identification of Flickr Videos Using Semantic Acoustic Features

City-identification of videos aims to determine the likelihood of a vide...
research
03/30/2021

Environmental sound analysis with mixup based multitask learning and cross-task fusion

Environmental sound analysis is currently getting more and more attentio...
research
09/26/2018

An extensible cluster-graph taxonomy for open set sound scene analysis

We present a new extensible and divisible taxonomy for open set sound sc...
research
07/09/2020

Low Cost Gunshot Detection using Deep Learning on the Raspberry Pi

Many cities using gunshot detection technology depend on expensive syste...
research
06/27/2022

Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework

Acoustic events are sounds with well-defined spectro-temporal characteri...

Please sign up or login with your details

Forgot password? Click here to reset