BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

03/11/2021
by   Daisuke Niizumi, et al.
0

Inspired by the recent progress in self-supervised learning for computer vision that generates supervision using data augmentations, we explore a new general-purpose audio representation learning approach. We propose learning general-purpose audio representation from a single audio segment without expecting relationships between different time segments of audio samples. To implement this principle, we introduce Bootstrap Your Own Latent (BYOL) for Audio (BYOL-A, pronounced "viola"), an audio self-supervised learning method based on BYOL for learning general-purpose audio representation. Unlike most previous audio self-supervised learning methods that rely on agreement of vicinity audio segments or disagreement of remote ones, BYOL-A creates contrasts in an augmented audio segment pair derived from a single audio segment. With a combination of normalization and augmentation techniques, BYOL-A achieves state-of-the-art results in various downstream tasks. Extensive ablation studies also clarified the contribution of each component and their combinations.

READ FULL TEXT
research
09/28/2022

Audio Barlow Twins: Self-Supervised Audio Representation Learning

The Barlow Twins self-supervised learning objective requires neither neg...
research
03/25/2022

DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning

Inspired by the recent progress in self-supervised learning for computer...
research
05/24/2019

Self-supervised audio representation learning for mobile devices

We explore self-supervised models that can be potentially deployed on mo...
research
12/08/2020

I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch

Growing research demonstrates that synthetic failure modes imply poor ge...
research
10/21/2020

Contrastive Learning of General-Purpose Audio Representations

We introduce COLA, a self-supervised pre-training approach for learning ...
research
07/11/2023

Self-Supervised Learning with Lie Symmetries for Partial Differential Equations

Machine learning for differential equations paves the way for computatio...
research
03/15/2023

Enhancing Unsupervised Audio Representation Learning via Adversarial Sample Generation

Existing audio analysis methods generally first transform the audio stre...

Please sign up or login with your details

Forgot password? Click here to reset