DCASE 2018 Challenge baseline with convolutional neural networks

08/02/2018
by   Qiuqiang Kong, et al.
0

The Detection and Classification of Acoustic Scenes and Events (DCASE) is a well-known IEEE AASP challenge consisting of a number of audio classification and sound event detection tasks. DCASE 2018 has five tasks: 1) Acoustic scene classification, 2) General-purpose audio tagging, 3) Bird audio detection, 4) Weakly-labeled semi-supervised sound event detection and 5) Multi-channel audio tagging. In this paper, we describe our Python implementation of all of these tasks, which we have released under an open-source license. The baseline source code contains the implementation of convolutional neural networks (CNNs), including AlexNetish and VGGish -- networks originating from computer vision. We researched how the performance varies from task to task with the same configuration of neural networks. Experiments show that the deeper VGGish network performs better than AlexNetish on all tasks except Task 1, where VGGish and AlexNetish perform similarly. With the VGGish network, we achieve an accuracy of 0.680 on Task 1, a mean average precision (mAP) of 0.928 on Task 2, an area under the curve (AUC) of 0.854 on Task 3, a sound event detection F1 score of 20.8% on Task 4 and an F1 score of 87.75% on Task 5.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2018

DCASE 2018 Challenge Surrey Cross-Task convolutional neural network baseline

The Detection and Classification of Acoustic Scenes and Events (DCASE) c...
research
08/10/2017

DNN and CNN with Weighted and Multi-task Loss Functions for Audio Event Detection

This report presents our audio event detection system submitted for Task...
research
05/30/2023

E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks

Sounds carry an abundance of information about activities and events in ...
research
03/25/2022

AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification

After its sweeping success in vision and language tasks, pure attention-...
research
02/28/2023

Incremental Learning of Acoustic Scenes and Sound Events

In this paper, we propose a method for incremental learning of two disti...
research
04/27/2021

DASEE A Synthetic Database of Domestic Acoustic Scenes and Events in Dementia Patients Environment

Access to informative databases is a crucial part of notable research de...
research
10/14/2022

Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system

The aim of the Detection and Classification of Acoustic Scenes and Event...

Please sign up or login with your details

Forgot password? Click here to reset