I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization

09/14/2022
by   Dianwen Ng, et al.
0

Noise robustness in keyword spotting remains a challenge as many models fail to overcome the heavy influence of noises, causing the deterioration of the quality of feature embeddings. We proposed a contrastive regularization method called Inter-Intra Contrastive Regularization (I2CR) to improve the feature representations by guiding the model to learn the fundamental speech information specific to the cluster. This involves maximizing the similarity across Intra and Inter samples of the same class. As a result, it pulls the instances closer to more generalized representations that form more prominent clusters and reduces the adverse impact of noises. We show that our method provides consistent improvements in accuracy over different backbone model architectures under different noise environments. We also demonstrate that our proposed framework has improved the accuracy of unseen out-of-domain noises and unseen variant noise SNRs. This indicates the significance of our work with the overall refinement in noise robustness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2022

Cluster-based Contrastive Disentangling for Generalized Zero-Shot Learning

Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and un...
research
07/02/2021

How Incomplete is Contrastive Learning? An Inter-intra Variant Dual Representation Method for Self-supervised Video Recognition

Contrastive learning applied to self-supervised representation learning ...
research
08/06/2020

Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework

We propose a self-supervised method to learn feature representations fro...
research
02/28/2023

deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition

Existing self-supervised pre-trained speech models have offered an effec...
research
01/27/2022

Contrastive Embedding Distribution Refinement and Entropy-Aware Attention for 3D Point Cloud Classification

Learning a powerful representation from point clouds is a fundamental an...
research
05/03/2022

Efficient dynamic filter for robust and low computational feature extraction

Unseen noise signal which is not considered in a model training process ...
research
12/31/2021

Disjoint Contrastive Regression Learning for Multi-Sourced Annotations

Large-scale datasets are important for the development of deep learning ...

Please sign up or login with your details

Forgot password? Click here to reset