Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise

11/19/2022
by   Iván López-Espejo, et al.
0

In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnable features has not yielded superior KWS performance. In this study, we demonstrate that filterbank learning outperforms handcrafted speech features for KWS whenever the number of filterbank channels is severely decreased. Reducing the number of channels might yield certain KWS performance drop, but also a substantial energy consumption reduction, which is key when deploying common always-on KWS on low-resource devices. Experimental results on a noisy version of the Google Speech Commands Dataset show that filterbank learning adapts to noise characteristics to provide a higher degree of robustness to noise, especially when dropout is integrated. Thus, switching from typically used 40-channel log-Mel features to 8-channel learned features leads to a relative KWS accuracy loss of only 3.5 achieving a 6.3x energy consumption reduction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2023

Contrastive Speech Mixup for Low-resource Keyword Spotting

Most of the existing neural-based models for keyword spotting (KWS) in s...
research
04/11/2022

Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness

It is critical for a keyword spotting model to have a small footprint as...
research
05/30/2020

Exploring Filterbank Learning for Keyword Spotting

Despite their great performance over the years, handcrafted speech featu...
research
10/20/2022

Discriminatory and orthogonal feature learning for noise robust keyword spotting

Keyword Spotting (KWS) is an essential component in a smart device for a...
research
02/04/2022

A Fast Network Exploration Strategy to Profile Low Energy Consumption for Keyword Spotting

Keyword Spotting nowadays is an integral part of speech-oriented user in...
research
05/21/2023

DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting

Real-world complex acoustic environments especially the ones with a low ...
research
05/16/2021

Zero Aware Configurable Data Encoding by Skipping Transfer for Error Resilient Applications

In this paper, we propose Zero Aware Configurable Data Encoding by Skipp...

Please sign up or login with your details

Forgot password? Click here to reset