Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks

02/25/2020
by   Théodore Bluche, et al.
0

We explore a keyword-based spoken language understanding system, in which the intent of the user can directly be derived from the detection of a sequence of keywords in the query. In this paper, we focus on an open-vocabulary keyword spotting method, allowing the user to define their own keywords without having to retrain the whole model. We describe the different design choices leading to a fast and small-footprint system, able to run on tiny devices, for any arbitrary set of user-defined keywords, without training data specific to those keywords. The model, based on a quantized long short-term memory (LSTM) neural network, trained with connectionist temporal classification (CTC), weighs less than 500KB. Our approach takes advantage of some properties of the predictions of CTC-trained networks to calibrate the confidence scores and implement a fast detection algorithm. The proposed system outperforms a standard keyword-filler model approach.

READ FULL TEXT

page 1

page 11

page 12

research
12/16/2019

Predicting detection filters for small footprint open-vocabulary keyword spotting

In many scenarios, detecting keywords from natural language queries is s...
research
10/11/2019

Query-by-example on-device keyword spotting

A keyword spotting (KWS) system determines the existence of, usually pre...
research
03/10/2016

Personalized Speech recognition on mobile devices

We describe a large vocabulary speech recognition system that is accurat...
research
06/23/2022

QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer

Current keyword spotting systems are typically trained with a large amou...
research
11/26/2018

DONUT: CTC-based Query-by-Example Keyword Spotting

Keyword spotting--or wakeword detection--is an essential feature for han...
research
07/22/2019

Towards an LSTM-based Predictive Framework for Literature-based Knowledge Discovery

Literature-based knowledge discovery process identifies the important bu...
research
10/18/2021

VRM-Phase I VKW system description of long-short video customizable keyword wakeup challenge

Keyword wakeup technology has always been a research hotspot in speech p...

Please sign up or login with your details

Forgot password? Click here to reset