Discriminatory and orthogonal feature learning for noise robust keyword spotting

10/20/2022
by   Donghyeon Kim, et al.
0

Keyword Spotting (KWS) is an essential component in a smart device for alerting the system when a user prompts it with a command. As these devices are typically constrained by computational and energy resources, the KWS model should be designed with a small footprint. In our previous work, we developed lightweight dynamic filters which extract a robust feature map within a noisy environment. The learning variables of the dynamic filter are jointly optimized with KWS weights by using Cross-Entropy (CE) loss. CE loss alone, however, is not sufficient for high performance when the SNR is low. In order to train the network for more robust performance in noisy environments, we introduce the LOw Variant Orthogonal (LOVO) loss. The LOVO loss is composed of a triplet loss applied on the output of the dynamic filter, a spectral norm-based orthogonal loss, and an inner class distance loss applied in the KWS model. These losses are particularly useful in encouraging the network to extract discriminatory features in unseen noise environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2022

Efficient dynamic filter for robust and low computational feature extraction

Unseen noise signal which is not considered in a model training process ...
research
12/08/2022

Logit Clipping for Robust Learning against Label Noise

In the presence of noisy labels, designing robust loss functions is crit...
research
11/19/2022

Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise

In the context of keyword spotting (KWS), the replacement of handcrafted...
research
01/15/2022

ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting

Building efficient architecture in neural speech processing is paramount...
research
03/26/2019

Multiscale CNN based Deep Metric Learning for Bioacoustic Classification: Overcoming Training Data Scarcity Using Dynamic Triplet Loss

This paper proposes multiscale convolutional neural network (CNN)-based ...
research
06/03/2023

Few-Shot Open-Set Learning for On-Device Customization of KeyWord Spotting Systems

A personalized KeyWord Spotting (KWS) pipeline typically requires the tr...
research
09/15/2021

Behavior of Keyword Spotting Networks Under Noisy Conditions

Keyword spotting (KWS) is becoming a ubiquitous need with the advancemen...

Please sign up or login with your details

Forgot password? Click here to reset