GISE-51: A scalable isolated sound events dataset

03/23/2021
by   Sarthak Yadav, et al.
0

Most of the existing isolated sound event datasets comprise a small number of sound event classes, usually 10 to 15, restricted to a small domain, such as domestic and urban sound events. In this work, we introduce GISE-51, a dataset spanning 51 isolated sound events belonging to a broad domain of event types. We also release GISE-51-Mixtures, a dataset of 5-second soundscapes with hard-labelled event boundaries synthesized from GISE-51 isolated sound events. We conduct baseline sound event recognition (SER) experiments on the GISE-51-Mixtures dataset, benchmarking prominent convolutional neural networks, and models trained with the dataset demonstrate strong transfer learning performance on existing audio recognition benchmarks. Together, GISE-51 and GISE-51-Mixtures attempt to address some of the shortcomings of recent sound event datasets, providing an open, reproducible benchmark for future research along with the freedom to adapt the included isolated sound events for domain-specific applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2019

The NIGENS General Sound Events Database

Computational auditory scene analysis is gaining interest in the last ye...
research
05/06/2021

USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios

This paper introduces a novel dataset for polyphonic sound event detecti...
research
10/01/2020

FSD50K: an Open Dataset of Human-Labeled Sound Events

Most existing datasets for sound event recognition (SER) are relatively ...
research
02/26/2020

An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments

The problem of training a deep neural network with a small set of positi...
research
05/11/2020

Foreground-Background Ambient Sound Scene Separation

Ambient sound scenes typically comprise multiple short events occurring ...
research
10/27/2019

Sound Event Recognition in a Smart City Surveillance Context

Due to the growing demand for improving surveillance capabilities in sma...
research
03/04/2022

Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection

In recent years, exploring effective sound separation (SSep) techniques ...

Please sign up or login with your details

Forgot password? Click here to reset