Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning

09/02/2023
by   Ilyass Moummad, et al.
0

Deep learning has been widely used recently for sound event detection and classification. Its success is linked to the availability of sufficiently large datasets, possibly with corresponding annotations when supervised learning is considered. In bioacoustic applications, most tasks come with few labelled training data, because annotating long recordings is time consuming and costly. Therefore supervised learning is not the best suited approach to solve bioacoustic tasks. The bioacoustic community recasted the problem of sound event detection within the framework of few-shot learning, i.e. training a system with only few labeled examples. The few-shot bioacoustic sound event detection task in the DCASE challenge focuses on detecting events in long audio recordings given only five annotated examples for each class of interest. In this paper, we show that learning a rich feature extractor from scratch can be achieved by leveraging data augmentation using a supervised contrastive learning framework. We highlight the ability of this framework to transfer well for five-shot event detection on previously unseen classes in the training data. We obtain an F-score of 63.46% on the validation set and 42.7% on the test set, ranking second in the DCASE challenge. We provide an ablation study for the critical choices of data augmentation techniques as well as for the learning strategy applied on the training set.

READ FULL TEXT
research
09/16/2023

Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection

Bioacoustic sound event detection allows for better understanding of ani...
research
05/22/2023

Learning to detect an animal sound from five examples

Automatic detection and classification of animal sounds has many applica...
research
10/09/2021

A Mutual learning framework for Few-shot Sound Event Detection

Although prototypical network (ProtoNet) has proved to be an effective m...
research
08/19/2023

Robust Fraud Detection via Supervised Contrastive Learning

Deep learning models have recently become popular for detecting maliciou...
research
09/22/2021

Few-Shot Sound Source Distance Estimation Using Relation Networks

In this paper, we study the performance of few-shot learning, specifical...
research
07/14/2022

Few-shot bioacoustic event detection at the DCASE 2022 challenge

Few-shot sound event detection is the task of detecting sound events, de...

Please sign up or login with your details

Forgot password? Click here to reset