AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data

07/13/2021
by   Menglong Xu, et al.
0

Deep neural networks provide effective solutions to small-footprint keyword spotting (KWS). However, if training data is limited, it remains challenging to achieve robust and highly accurate KWS in real-world scenarios where unseen sounds that are out of the training data are frequently encountered. Most conventional methods aim to maximize the classification accuracy on the training set, without taking the unseen sounds into account. To enhance the robustness of the deep neural networks based KWS, in this paper, we introduce a new loss function, named the maximization of the area under the receiver-operating-characteristic curve (AUC). The proposed method not only maximizes the classification accuracy of keywords on the closed training set, but also maximizes the AUC score for optimizing the performance of non-keyword segments detection. Experimental results on the Google Speech Commands dataset v1 and v2 show that our method achieves new state-of-the-art performance in terms of most evaluation metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2020

Metric Learning for Keyword Spotting

The goal of this work is to train effective representations for keyword ...
research
10/28/2017

Deep Residual Learning for Small-Footprint Keyword Spotting

We explore the application of deep residual learning and dilated convolu...
research
12/11/2019

Small-footprint Keyword Spotting with Graph Convolutional Network

Despite the recent successes of deep neural networks, it remains challen...
research
10/27/2021

Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

Area under the ROC curve (AUC) optimisation techniques developed for neu...
research
08/09/2023

Expert load matters: operating networks at high accuracy and low manual effort

In human-AI collaboration systems for critical applications, in order to...
research
11/16/2022

PBSM: Backdoor attack against Keyword spotting based on pitch boosting and sound masking

Keyword spotting (KWS) has been widely used in various speech control sc...
research
01/28/2021

The fraud loss for selecting the model complexity in fraud detection

In fraud detection applications, the investigator is typically limited t...

Please sign up or login with your details

Forgot password? Click here to reset