How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks

04/05/2022
by   Keisuke Imoto, et al.
0

Acoustic scene classification (ASC) and sound event detection (SED) are fundamental tasks in environmental sound analysis, and many methods based on deep learning have been proposed. Considering that information on acoustic scenes and sound events helps SED and ASC mutually, some researchers have proposed a joint analysis of acoustic scenes and sound events by multitask learning (MTL). However, conventional works have not investigated in detail how acoustic scenes and sound events mutually benefit SED and ASC. We, therefore, investigate the impact of information on acoustic scenes and sound events on the performance of SED and ASC by using domain adversarial training based on a gradient reversal layer (GRL) or model training with fake labels. Experimental results obtained using the TUT Acoustic Scenes 2016/2017 and TUT Sound Events 2016/2017 show that pieces of information on acoustic scenes and sound events are effectively used to detect sound events and classify acoustic scenes, respectively. Moreover, upon comparing GRL- and fake-label-based methods with single-task-based ASC and SED methods, single-task-based methods are found to achieve better performance. This result implies that even when using single-task-based ASC and SED methods, information on acoustic scenes may be implicitly utilized for SED and vice versa.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2020

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels

Sound event detection (SED) and acoustic scene classification (ASC) are ...
research
03/30/2021

Environmental sound analysis with mixup based multitask learning and cross-task fusion

Environmental sound analysis is currently getting more and more attentio...
research
06/27/2022

Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework

Acoustic events are sounds with well-defined spectro-temporal characteri...
research
04/10/2019

A Compact and Discriminative Feature Based on Auditory Summary Statistics for Acoustic Scene Classification

One of the biggest challenges of acoustic scene classification (ASC) is ...
research
04/05/2019

Deep Learning Features for Robust Detection of Acoustic Events in Sleep-Disordered Breathing

Sleep-disordered breathing (SDB) is a serious and prevalent condition, a...
research
11/26/2018

Scene Recognition Through Visual and Acoustic Cues Using K-Means

We propose a K-Means based prediction system, nicknamed SERVANT (Scene R...
research
10/14/2022

Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system

The aim of the Detection and Classification of Acoustic Scenes and Event...

Please sign up or login with your details

Forgot password? Click here to reset