Teaching keyword spotters to spot new keywords with limited examples

06/04/2021
by   Abhijeet Awasthi, et al.
0

Learning to recognize new keywords with just a few examples is essential for personalizing keyword spotting (KWS) models to a user's choice of keywords. However, modern KWS models are typically trained on large datasets and restricted to a small vocabulary of keywords, limiting their transferability to a broad range of unseen keywords. Towards easily customizable KWS models, we present KeySEM (Keyword Speech EMbedding), a speech embedding model pre-trained on the task of recognizing a large number of keywords. Speech representations offered by KeySEM are highly effective for learning new keywords from a limited number of examples. Comparisons with a diverse range of related work across several datasets show that our method achieves consistently superior performance with fewer training examples. Although KeySEM was pre-trained only on English utterances, the performance gains also extend to datasets from four other languages indicating that KeySEM learns useful representations well aligned with the task of keyword spotting. Finally, we demonstrate KeySEM's ability to learn new keywords sequentially without requiring to re-train on previously learned keywords. Our experimental observations suggest that KeySEM is well suited to on-device environments where post-deployment learning and ease of customization are often desirable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/03/2021

Few-Shot Keyword Spotting in Any Language

We introduce a few-shot transfer learning method for keyword spotting in...
research
01/31/2020

Training Keyword Spotters with Limited and Synthesized Speech Data

With the rise of low power speech-enabled devices, there is a growing de...
research
06/23/2022

QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer

Current keyword spotting systems are typically trained with a large amou...
research
01/08/2020

REST: A thread embedding approach for identifying and classifying user-specified information in security forums

How can we extract useful information from a security forum? We focus on...
research
07/25/2020

Few-Shot Keyword Spotting With Prototypical Networks

Recognizing a particular command or a keyword, keyword spotting has been...
research
03/30/2022

Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting

Catastrophic forgetting is a thorny challenge when updating keyword spot...
research
06/03/2023

Few-Shot Open-Set Learning for On-Device Customization of KeyWord Spotting Systems

A personalized KeyWord Spotting (KWS) pipeline typically requires the tr...

Please sign up or login with your details

Forgot password? Click here to reset