Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers

07/28/2021
by   Piper Wolters, et al.
2

There are many important applications for detecting and localizing specific sound events within long, untrimmed documents including keyword spotting, medical observation, and bioacoustic monitoring for conservation. Deep learning techniques often set the state-of-the-art for these tasks. However, for some types of events, there is insufficient labeled data to train deep learning models. In this paper, we propose novel approaches to few-shot sound event detection utilizing region proposals and the Perceiver architecture, which is capable of accurately localizing sound events with very few examples of each class of interest. Motivated by a lack of suitable benchmark datasets for few-shot audio event detection, we generate and evaluate on two novel episodic rare sound event datasets: one using clips of celebrity speech as the sound event, and the other using environmental sounds. Our highest performing proposed few-shot approaches achieve 0.575 and 0.672 F1-score, respectively, with 5-shot 5-way tasks on these two datasets. These represent absolute improvements of 0.200 and 0.234 over strong proposal-free few-shot sound event detection baselines.

READ FULL TEXT

page 1

page 6

research
07/14/2022

Few-shot bioacoustic event detection at the DCASE 2022 challenge

Few-shot sound event detection is the task of detecting sound events, de...
research
10/07/2021

Peer Collaborative Learning for Polyphonic Sound Event Detection

This paper describes that semi-supervised learning called peer collabora...
research
06/15/2023

Few-shot bioacoustic event detection at the DCASE 2023 challenge

Few-shot bioacoustic event detection consists in detecting sound events ...
research
06/18/2023

Channel-Spatial-Based Few-Shot Bird Sound Event Detection

In this paper, we propose a model for bird sound event detection that fo...
research
05/24/2022

Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection

Sound event detection is to infer the event by understanding the surroun...
research
10/09/2021

A Mutual learning framework for Few-shot Sound Event Detection

Although prototypical network (ProtoNet) has proved to be an effective m...
research
05/16/2023

MsPrompt: Multi-step Prompt Learning for Debiasing Few-shot Event Detection

Event detection (ED) is aimed to identify the key trigger words in unstr...

Please sign up or login with your details

Forgot password? Click here to reset