Unsupervised Contrastive Learning of Sound Event Representations

11/15/2020
by   Eduardo Fonseca, et al.
5

Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data—a common scenario in sound event research. In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. To this end, we propose to use the pretext task of contrasting differently augmented views of sound events. The views are computed primarily via mixing of training examples with unrelated backgrounds, followed by other data augmentations. We analyze the main components of our method via ablation experiments. We evaluate the learned representations using linear evaluation, and in two in-domain downstream sound event classification tasks, namely, using limited manually labeled data, and using noisy labeled data. Our results suggest that unsupervised contrastive pre-training can mitigate the impact of data scarcity and increase robustness against noisy labels, outperforming supervised baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2021

Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations

Improving generalization is a major challenge in audio classification du...
research
09/16/2020

Evaluating Self-Supervised Pretraining Without Using Labels

A common practice in unsupervised representation learning is to use labe...
research
06/20/2020

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

We show for the first time that learning powerful representations from s...
research
08/17/2021

MVCNet: Multiview Contrastive Network for Unsupervised Representation Learning for 3D CT Lesions

Objective and Impact Statement. With the renaissance of deep learning, a...
research
08/04/2022

Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations

Sound is one of the most informative and abundant modalities in the real...
research
06/30/2020

A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

An important problem in machine auditory perception is to recognize and ...
research
11/17/2022

Balanced Deep CCA for Bird Vocalization Detection

Event detection improves when events are captured by two different modal...

Please sign up or login with your details

Forgot password? Click here to reset