Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data

01/31/2022
by   Amir Shirian, et al.
4

Large scale databases with high-quality manual annotations are scarce in audio domain. We thus explore a self-supervised graph approach to learning audio representations from highly limited labelled data. Considering each audio sample as a graph node, we propose a subgraph-based framework with novel self-supervision tasks that can learn effective audio representations. During training, subgraphs are constructed by sampling the entire pool of available training data to exploit the relationship between the labelled and unlabeled audio samples. During inference, we use random edges to alleviate the overhead of graph construction. We evaluate our model on three benchmark audio databases, and two tasks: acoustic event detection and speech emotion recognition. Our semi-supervised model performs better or on par with fully supervised models and outperforms several competitive existing models. Our model is compact (240k parameters), and can produce generalized audio representations that are robust to different types of signal noise.

READ FULL TEXT

page 1

page 7

research
01/13/2020

Visually Guided Self Supervised Learning of Speech Representations

Self supervised representation learning has recently attracted a lot of ...
research
07/08/2020

Learning Speech Representations from Raw Audio by Joint Audiovisual Self-Supervision

The intuitive interaction between the audio and visual modalities is val...
research
09/03/2022

Equivariant Self-Supervision for Musical Tempo Estimation

Self-supervised methods have emerged as a promising avenue for represent...
research
03/08/2023

New Audio Representations Image Gan Generation from BriVL

Recently, researchers have gradually realized that in some cases, the se...
research
12/07/2022

Self-Supervised PPG Representation Learning Shows High Inter-Subject Variability

With the progress of sensor technology in wearables, the collection and ...
research
05/21/2021

Semi-Supervised Audio Representation Learning for Modeling Beehive Strengths

Honey bees are critical to our ecosystem and food security as a pollinat...

Please sign up or login with your details

Forgot password? Click here to reset