Sampling strategies in Siamese Networks for unsupervised speech representation learning

04/30/2018
by   Rachid Riad, et al.
0

Recent studies have investigated siamese network architectures for learning invariant speech representations using same-different side information at the word level. Here we investigate systematically an often ignored component of siamese networks: the sampling procedure (how pairs of same vs. different tokens are selected). We show that sampling strategies taking into account Zipf's Law, the distribution of speakers and the proportions of same and different pairs of words significantly impact the performance of the network. In particular, we show that word frequency compression improves learning across a large range of variations in number of training pairs. This effect does not apply to the same extent to the fully unsupervised setting, where the pairs of same-different words are obtained by spoken term discovery. We apply these results to pairs of words discovered using an unsupervised algorithm and show an improvement on state-of-the-art in unsupervised representation learning using siamese networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2020

Unsupervised Spoken Term Discovery Based on Re-clustering of Hypothesized Speech Segments with Siamese and Triplet Networks

Spoken term discovery from untranscribed speech audio could be achieved ...
research
02/07/2022

Crafting Better Contrastive Views for Siamese Representation Learning

Recent self-supervised contrastive learning methods greatly benefit from...
research
11/20/2020

Exploring Simple Siamese Representation Learning

Siamese networks have become a common structure in various recent models...
research
01/03/2020

Deep Unsupervised Common Representation Learning for LiDAR and Camera Data using Double Siamese Networks

Domain gaps of sensor modalities pose a challenge for the design of auto...
research
12/29/2015

Common Variable Learning and Invariant Representation Learning using Siamese Neural Networks

We consider the statistical problem of learning common source of variabi...
research
01/30/2023

Exploring Image Augmentations for Siamese Representation Learning with Chest X-Rays

Image augmentations are quintessential for effective visual representati...
research
04/01/2022

On the Importance of Asymmetry for Siamese Representation Learning

Many recent self-supervised frameworks for visual representation learnin...

Please sign up or login with your details

Forgot password? Click here to reset