Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining

02/16/2021
by   Swapnil Bhosale, et al.
0

Few-shot learning aims to generalize unseen classes that appear during testing but are unavailable during training. Prototypical networks incorporate few-shot metric learning, by constructing a class prototype in the form of a mean vector of the embedded support points within a class. The performance of prototypical networks in extreme few-shot scenarios (like one-shot) degrades drastically, mainly due to the desuetude of variations within the clusters while constructing prototypes. In this paper, we propose to replace the typical prototypical loss function with an Episodic Triplet Mining (ETM) technique. The conventional triplet selection leads to overfitting, because of all possible combinations being used during training. We incorporate episodic training for mining the semi hard positive and the semi hard negative triplets to overcome the overfitting. We also propose an adaptation to make use of unlabeled training samples for better modeling. Experimenting on two different audio processing tasks, namely speaker recognition and audio event detection; show improved performances and hence the efficacy of ETM over the prototypical loss function and other meta-learning frameworks. Further, we show improved performances when unlabeled training samples are used.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2019

Constellation Loss: Improving the efficiency of deep metric learning loss functions for optimal embedding

Metric learning has become an attractive field for research on the lates...
research
04/19/2018

Deep Triplet Ranking Networks for One-Shot Recognition

Despite the breakthroughs achieved by deep learning models in convention...
research
04/24/2022

Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention

Although few-shot learning has attracted much attention from the fields ...
research
07/10/2020

Batch-Incremental Triplet Sampling for Training Triplet Networks Using Bayesian Updating Theorem

Variants of Triplet networks are robust entities for learning a discrimi...
research
05/19/2017

Quadruplet Network with One-Shot Learning for Visual Tracking

As a discriminative method of one-shot learning, Siamese deep network al...
research
01/20/2021

Few-shot Action Recognition with Prototype-centered Attentive Learning

Few-shot action recognition aims to recognize action classes with few tr...

Please sign up or login with your details

Forgot password? Click here to reset